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(57) Abstract 

The present invention relates to a method for 
preparing a normalized sub-divided library of am- 
plified cDNA fragments from the coding region of 
mRNA contained in a sample. It is an object of 
the present invention to provide new methods and 
means for investigating the expression patterns in 
cells, especially in eukaryotic cells. The results of 
such investigations may be used in drug develop- 
ment, gene discovery, diagnosis of diseases, etc. 
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A METHOD TO CLONE mRNAs AND DISPLAY OF DIFFERENTIALLY EXPRESSED TRANSCRIPTS (DODET) 
BACKGROUND OF' THE INVENTION 

The human body is comprised primarily of specialised cells 
performing different physiological functions organised into 
5 organs and tissues. All human cells contain DNA, arranged in 
a series of sub-units known as genes. It is estimated chat 
there are approximately 100,000 genes in the human genome. 
Genes are the blueprints for proteins. Proteins may perform a 
wide variety of biological functions, for example messengers, 

10 catalysts and sensors. Such compounds are responsible for 

managing most of the physiological and biochemical functions 
in humans and all other living organisms. Over the last few 
decades, there has been a growing recognition chat many major 
diseases have a genetic basis. It is now well established 

15 that genes play an important role in cancer, cardiovascular 
diseases, psychiatric disorders, obesity, and metabolic dis- 
eases. Significant resources are being focused on genomic 
researc h based on the notion that the nucleotide sequences of 
a particular gene and its predicted protein product will lead 

20 to an understanding of its function in healthy and malfunc- 
tioning cells or tissues. This understanding is expected, in 
turn, to lead to therapeutic and diagnostic approaches, 
focused on molecular targets associated with the gene and the 
crotein it expresses. The first step on the way to the deve- 

25 iopment of such applications is to identify the genes speci- 
fically involved in the different categories of diseases. 
Application of this knowledge can produce new and valuable 
markers, identifying regions producing major diseases to be 
used for diagnostic and therapeutic benefit. 

3C Faced with the high complexity of the human genome, many 

aoproaches are being used to unravel the connection between 
primary gene structure and function. One well publicised 
approach is embodied in the Human Genome Mapping Project, 
where the sequence of all the individual genes in the entire 

35 human genome is painstakingly ceing determined. At the pre- 
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sent, however, little information can be directly retrieved 
on the function of the identified genes and still less about 
temporal and spatial expression patterns of the developing or 
mature organism. Other approaches, such as random cDNA se- 
quencing, involve the sequence determination of all genes 
expressed in a certain tissue, or developmental stage, of an 
organism. Like a number of other strategies, this is time 
consuming and prone to numerous problems . 

Although the flood of data from large scale sequencing pro- 
grammes is of enormous benefit to the scientific community, 
one of the major problems faced by such "shotgun" approaches 
is the lack of specific information that can be retrieved 
without significantly more work on the biology of each of the 
individual genes . 

Several other approaches have been taken by molecular biolo- 
gists to obtain more specific information on the genetic 
background of particular biological processes. Such approa- 
ches rely on a common concept. One gene, or a subset of 
genes, is switched on, initiating the healthy, pathological, 
or developmental status of an organ or cell type. 

In a large number of experimental systems the isolation of 
genes, on the basis of their differential expression, has 
been applied successfully. Differential screening and sub- 
tractive hybridisation of cDNA libraries have become well 
established, cf . Zimmerman ed al . (1980) and Davis at al . 
(1979). Differential library screening works well in practice 
for genes that are highly expressed, but mRNAs of low abun- 
dance are difficult to isolate. Subtractive hybridisation 
provides a more sensitive screening, but requires large 
amounts of RNA. More recently RNA fingerprinting methods 
(often referred to as differential display or DD/RT PGR) have 
been added to these tools, offering attractive new features 
for isolating genes. RNA fingerprinting methods are PGR based 
and therefore do not require large amounts of RNA for expe- 
riments. In addition to this, RNA fingerprinting methods 
allow a large number of RNA pccl^ to be screened for specific 
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mRNAs simultaneously. Investigation of a wide range of patho- 
genic developmental stages and their controls would be pos- 
sible. To date, two methods of RNA fingerprinting have proven 
useful for isolating genes. In 1992 Liang et al . published a 
protocol (US Patent 5,262,311), soon after a protocol from 
Welsh et al . (1992) was presented. Both methods begin with 
cDNA synthesis from RNA using at least one arbitrary primer 
for the initiation of first and second strand synthesis. 

Welsh et al . (1992) designed a protocol in which the same 
arbitrary 20-mer oligo is used for first and second strand 
synthesis. Using arbitrary primers only a subset of the mRNAs 
are transcribed tc cDNA. Th-r cuXA pools are then used for a 
standard PCR with the same primers. One of the dNTPs in the 
PGR mix contains a radioactive label i 2o S or -^P) for visua- 
lisation of the PCR fragments with PAGE. The Liang and Welsh 
methods rely on at least one small arbitrary primer for 
selection of specific cDNAs . As a consequence annealing 
temperatures are low {~40°C5 , and all amplified cDNA frag- 
ments originate from a certain degree of mismatch priming. 
Later several groups produced refinements and optimisations 
leading to a plethora of articles describing the usefulness 
of the method (3ciuer and Warthoe et al. 1993; Warthoe et al. 
1995; Liang and Warthoe et al. 1995; Rohde and Warthoe et al . 
1996) . 



OBJECT OF THE INVENTION 



It is an object of the present invention to provide new 
methods and means for investigating the expression patterns 
in cells, especially in eukaryotic cells. The results of such 
investigations may be used in drug development, gene discove- 
ry, diagnosis of diseases etc., ana therefore such improved 
methods are highly desirable. 
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In its broadest scope; the invention pertains to a method for 
preparing a sub -divided library of amplified cDNA fragments 
from the coding region of mRNA contained in a sample, the 
method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer having the 
general formula 

5" -Cor^-dT^-V^-N^ -3' 

wherein Con x is any sequence between 1-100 nucleotides, 
dT is deoxychymidinyl , V is A, G or C, N is A, G, C or T, 
n2 is an integer > 1, n3 is G or 1 , if n3 is 0 then n4 is 
0, and if n3 is 1 n4 is an integer > 0, thereby obtaining 
first strand cDNA fragments , 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, and a second cDNA primer with 
the general formula 

5 ' -Con 2 -N x .3 ' 

wherein Con ? is any sequence between 1-100 nucleotides 
and can be different or identical to con x , N x is A, G, T 
or C, and x is an inceger > 0, in a appropriate enzyme/ - 
buffer solution which comprises the DNA pol I enzyme or 
the Klenow fragment of the DNA pol I enzyme, all four 
deoxyribonucleoside triphosphates and standard buffer and 
temperature conditions, thereby obtaining double stranded 
cDNA fragments, 

c) subjecting the cDNA f"r:- riv-nrs obtained in step b) to a 
molecular amplification pi. •■-.-iur- so as to obtain ampli- 



WO 98/^1789 PCT/DK98/00I86 

5 

fied cDNA fragments, wherein is used a set of amplifica- 
tion primers having the general formula 



5' -Con 3 -N nl -3' I 

wherein Con 3 is a sequence identical to either Con x or 
Con 2 or both, N is A , G, T or C, and nl is an integer > 
0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con, or Con 2 . 

This method is advantageous for amplifying very small amounts 
of RNA. Using the method of the invention it is possible to 
perform gene -profile analysis from less than 100 cells equal 
co 10 " 9 gram total RNA (10 pgram RNA per cell) . 

In a further aspect, the invention relates to a method for 
preparing a sub-divided library of amplified cDNA fragments 
from the coding region of mRNA {which may be of prokaryotic, 
Arenas or eukaryotic origin) contained in a sample, the 
method comprising the steps of 



a) subjecting the mRNA derived from the sample to rever- 
se transcription using an least one cDNA primer, thereby 
obtaining first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, thereby obtaining double 
stranded cDNA fragments, 

c) digesting the double stranded cDNA fragments with at 
least one restriction endonuciease , thereby obtaining 
cleaved cDNA fragments , 



WO 98/51789 PCT/DK98/00186 

6 

d) ligating at least two adapter fragments to the clea- 
ved cDNA fragments obtained in step c) , so as to obtain . 
ligated cDNA fragments, and 

e) subjecting the ligated cDNA fragments obtained in 
step d) to a molecular amplification procedure so as to 
obtain amplified cDNA fragments, wherein is used, for an 
adapter fragment used in step d) , a set of amplification 
primers having the general formula 



5 ' - Com-N n: - 3 ' 



wn 



.erein Com is a sequence complementary to at lease the 
5' -end of an adapter fragment which is ligated to che 3'- 
enc of a cleaved cDNA fragment, N is A , G, T, or C, and 
nl is an integer > 0, and wherein at least one set of 
primers has the general formula II where nl > 0, said at 
least one set being capable of priming amplification of 
any nucleotide sequence ligated in its 3' -end to che 
adapter fragment complementary in its 5' -end to Com. 

The overall advantage of the invention compared to the prior 
art is that the resulting library of cDNA fragments contains 
nucleic acid sequences from all parts of cDNA which is pro- 
duced in step a) . Prior art techniques which i.a. rely on 
poly-dT cDNA priming have a tendency to only yield fragments 
derived from the long untranslated regions of mRNA. Further- 
more, by fine-tuning of the conditions in each step, the 
method of the present invention results in highly specific 
reproduction of sequence information which is present in 
mRNA, even in mRNA which is only present in relatively low 
amounts. Furthermore, by choosing the optimum composition of 
endonuclease (s) it is possible to obtain cDNA fragments which 
are derived from a very large" percentage of the total number 
of transcribed ger.es in relevant cells. 

The present method allows the targeted visualisation of known 
genes by using primer combinat i ens , torresponding to sequen- 
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ces from the gene of interest. This has the advantage that 
all steps of the procedure and the biological system can 
easily be verified. Also, very specific expression analyses 
can be carried out on related genes with very high homology 
which could not be achieved by using hybridisation technolo- 
gy. 

3riefiy, further steps in the method of the invention involve 
isolation of bands of interest from a gel, their cloning and 
sequencing. The sequence information allows re-amplification 
cf individual bands, using primers with the appropriate 3-4 
nucleotide extensions. When run on a gel, these reactions 
will show one, or only a few, bands per lane, giving an 
unequivocal determination cf band identity . 

Since the present technology makes use of end labelled pri- 
mers for visualization, the technology can be used, both with 
standard technologies involving radioactivity, or with fluo- 
rescent labelled primers, without the need for further opti- 
misation . 

The invention also pertains to methods for detecting diffe- 
rences between expression level (s) in cells which have C£ ^n 
subjected to different conditions, methods for diagnosing 
disease, and methods related to "bioinf ormat ics 11 wherein are 
used a combination of output from the above-disclosed method 
and data obtained by computer- simulation of corresponding 
treatment of well-defined stretches of nucleic acids. 

A separate part of the invention pertains to a novel method 
for performing reverse transcription, methods which yield 
considerably enhanced quality in the reversely transcribed 
material. Also means for carrying out this separate part of 
the invention are disclosed. 

In the following is given a short discussion of terms used in 
the present application: 
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"A sub-divided library of amplified cDNA fragments" is in the 
present context a library of amplified cDNA fragments which 
is split into a number of separate pools, each pool being 
defined by the sequences of the termini of the amplified 
fragments. For example, one pool may contain amplified frag- 
ments which are all characterized by having the sequence 5'- 
Com-AGC- in one of the strands, whereas another pool contains 
amplified fragments having the sequence 5'-Com-AAT in one of 
the strands. For a discussion of the meaning of "Con" and 
"Com", cf. below. 

"A normalised library" is a library containing substantially 
equal representation of each mRNA, i.e. approximately the 
same number of copies of each mRNA. 

"Reverse transcription" has its usual meaning in the art, 
i.e. synthesis of DNA using RNA as a template and effected by 
an enzyme having reverse transcriptase activity. 

"Adapter fragment" is intended to mean a nucleic acid sequen- 
ce containing a known sequence which can be used as template 
for a primer in a subsequent molecular amplification proce- 
dure such as ?C r . The adapter fragment is further characte- 
rized by its ability to become integrated at the end of a 
cDNA fragment which has previously been cleaved with a re- 
striction endonuclease in step c) . In most cases, the re- 
striction endonuclease leaves fragments having "sticky ends", 
to which the adapter fragment will anneal readily, and there- 
after the adapter fragment becomes ligated to the cDNA by the 
action of a DNA ligase. 



DETAILED DISCLOSURE OF THE INVENTION 



In the following, the. impact of each of the steps will be 
discussed in detail, see Figures 1 and 7. 
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The goal in step a) is to produce a mixture of first strand 
cDNA fragments which is optimized in its composition for 
carrying out the subsequent steps. A number of considerations 
apply: 

First of all, to reduce the "background noise", it is prefer- 
red that the annealing of cDNA primer to the RNA in step a) 
is performed under high stringency conditions, thereby en- 
suring that a minimum of mismatches are introduced in the 
cDNA relative to the mRNA, i.e. at a_ temperature above 50°C. 

Secondly, it is desirable to obtain copies of sequences which 
are derived from all parts of mRNA in order to obtain infor- 
mation relating to the translated part of the mRNA. Prior art 
methods for reverse transcription of eukaryotic material have 
often utilised poly-dT as cDNA primers. This strategy has, 
however, the disadvantage that the most efficiently reverse 
transcribed material is situated in the untranslated part of 
the genes of interest. Hence, the only parts of the mRNA 
which become "visible" after e.g. a PCR procedure will very 
often be derived from untranslated regions of the RNA. The 
reason for this is two effects. First of ail, the poiy-dT 
approach has the consequence that the initiation point of 
reverse transcription is situated very far from e.g. the 
start codon relating to the operon in question. Secondly, the 
mRNA may include structures (e.j. "hairpin" structures due to 
intra-chain base-pairing) which block reverse transcription 
and by always initiating reverse transcription at one termi- 
nus of a gene, such structures wil' statistically block 
reverse transcription of a number of translated regions. 

It is in the present invention preferred to ensure that cDNAs 
are produced in step a) which are representatives of the 
entire gene, including the translated regions. 
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This can be obtained in a number of ways. If poly-dT priming 
(or a variation thereof) is used, it is preferred to perform 
the reverse transcription at an elevated temperature, e.g. in 
the range from about 45°C to about 95°C, and to use an enzyme 
having reverse transcriptase activity at said temperature. 
Normally the temperature will be higher than 45°C, e.g. at 
least 50°C, or even higher, e.g. at least 55°C, at least 
60°C / at least 65°C or even higher, e.g. at least 70°C. This 
approach has the effect that the elevated temperature ensures 
that e.g. hairpin structures are "stretched out" during the 
reverse transcription step, thereby avoiding the lack of 
reversely transcribed fragments upstream of such structures. 

Known enzymes having reverse transcriptase activity ac such 
elevated temperatures are enzymes selected from the group 
consisting of DNA polymerases derived from thermophilic 
eubacteria, such as the polymerases Taq (Thermus aquaticus) , 
Stoffel (Thermus aquaticus) , Tht (Thermus thermophilus ) , 
Tfl/Tub (Thermus flavus) , Tru (Thermus Ruber) , Tea (Thermus 
caldophilus) , Tfil (Thermus fiiiformis), Tbr (Thermus Brocki- 
anus) , 3st (B. Stearothermophilus) , Bca (B. Caldotenax YT-G) , 
Bcav (B. Caldovelox YT-?) , ?jSS3-B.l (Thermotoga FjSS3 -B. 1) , 
Tma (Thermus Maritima) , UITma (T. Maritima) , Tli (T. Litora- 
iis) , Tli exo- (T. Litoralis), 9°N-7 (Thermococcus sp. ) , BG-D 
(Pyrococcus sp.), Pfu (P. furiosus) , Pwo (P. woesei), Sac (S. 
Acidocaldarius) , Ssol (S. Sol f acaricus ) , Tac (T. Acidophi- 
lum) , and Mth (Methananococcus Voltae) . 

One minor disadvantage of using these thermostable enzymes is 
that they have a tendency to be relatively ineffective com- 
pared to the "traditional" non- thermostable , reverse tran- 
scriptases. Hence, especially if priming of the reverse 
transcription is not limited to the use of poly-dT primers, 
it is according to the invention possible to use non- thermo- 
stable, reverse transcriptases. Hence, in other preferred 
embodiments, the reverse transcription is carried out at a 
temperature in the range from about 25°C to about 55°C by use 
of an enzyme having reverse transcriptase activity at said 
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temperature. Normally the temperature will not exceed 50°C, 
and usually it will be lower, such as at most 47°C, at mo.- 
45°C, at most 43°C, at most 40°C, and at most 35°C. The 
reverse transcriptase can e.g. be selected from the group 
consisting of the reverse transcriptases from AMV (Avian 
Myeloblastosis Virus), M-MuLV (murine M-MuLV pol gene), and 
HIV-l (HIV virus) . 

According to the invention, the most preferred way of carry- 
ing out step a) is to carry out reverse transcription in two 
subsequent steps, the first: step comprising carrying out 
reverse transcription at the temperature conditions described 
above for non- thermostable enzymes, and the second step 
comprising carrying out reverse transcription at the tempera- 
ture conditions described above for thermostable enzymes. 
Normally this can be accomplished by having two non- identical 
enzymes present in the reverse transcription reaction, espe- 
cially because the non- thermostable enzyme will be inactiva- 
ted by the increase in temperature which is introduced when 
going into step 2. Of course, the enzymes can be added for 
each reaction step, but it is preferred that both enzymes are 
present from the start of the reaction. 

It is especially preferred that the activity of the enzyme 
which is active in the first step is substantially abolished 
in the second step (e.g. as a consequence of temperature 
denaturing of that enzyme) , or expressed otherwise, that in 
the second s ten the enzyme used in the first step is substan- 
tially inactive. In general, it is preferred that the enzymes 
used in each step are substantially more active in the rele- 
vant temperature range than the one wherein the other enzyme 
is used. 

In a preferred embodiment the reaction mixture with the 
sample comprises a cDNA primer, said cDNA primer being suffi- 
ciently complementary to the target RNA present in the sample 
to hybridize therewith and initiate synthesis of a single 
stranded cDNA molecule complementary to said target RNA and 
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the reaction mixture comprises an appropriate buffer which 
comprises all four deoxyribonucleoside triphosphates and a 
divalent cation selected from the group of Mg' 2 and Mn 2+ in a 
concentration between 0.1 and 5 mM. 

In fact, it is believed that the above strategy for conduct- 
ing reverse transcription by use of two enzymes having diffe- 
rent temperature optima and of which one has a temperature 
optimum at which impeding structures in the RNA are "stretch- 
ed out", is -ovel and inventive in its own right. 

Preferred combinations of enzymes in this embodiment of the 
invention are that the enzyme effecting reverse transcription 
in the first seep is MMuLV , AMV, HIV-1 and-' or the enzyme 
effecting reverse transcription in the second step is Tth or 
Taq . 

An object of the method of the invention is to obtain a 
subdivision of the cDNA produced. When the mRNA is derived 
from a eukaryotic system, the at least one cDNA primer may 
include an oligo or poly dT tail in the 5' -end, having the 
general formula 5 ' -dT n2 -V n;> -N n4 -3 ' , wherein dT is deoxythymi- 
dinyl, V is A , G, or C, N is A, G, C, or T, n2 is an integer 
> 1, n3 is 0 or 1, if n3 is 0 then n4 is 0, and if n3 is 1 
then n4 is an integer a 0. It will be clear that when n3 and 
n4 are both zero, then the primer is an ordinary poly- or 
oligo-dT cDNA primer. However, when n3 is 1, then the primer 
is in fact a primer composition which will be able to prime 
the reverse transcrip^on of any mRNA having a poly-A tail. 
If the original sample of RNA is subdivided, and each sub- 
pool is subjected to reverse transcription which uses one of 
the possible primers having the above formula where n3 is 1, 
then the result is a number of single stranded cDNA pools 
which are each different from each other in the 5' -end. 

For example, when n3 is 1, 3 x 4 :M groups of cDNA primers are 
used, each group being distinct from any one of the other 
groups with respect to the structure - v.. v -N r A - . In such an 



RECTIFIED SHEET (RULE 91) 
ISA/EP 



WO 98/51789 l>CT/DKW00186 

13 

embodiment the pool of mRNA is conveniently subdivided into 
3 x 4 n4 aliquots which are each subjected separately to step 
a> utilising one of the 3 x 4 n4 groups of cDNA primers, 
thereby obtaining a subdivision of the first strand cDNA into 
3 x 4 n4 separate pools. Normally n4 will be 0 or 1, resulting 
in the provision of 3 or 12 pools, respectively. 

When the starting material is not eukaryotic or when it is 
not the intention to necessarily set out from the part of the 
transcribed gene which is most remote relative to the trans- 
lation start codoii, the at least one cDNA primer does not 
include a poly or oligo dT rail in the 5' -end, or, alternati- 
vely, at least two cDNA primers are used of which at least 
one includes a poly or oligc dT tail in the 5' -end and of 
which at least one second does not include a poly or oligo dT 
tail in the 5' -end. Preferably, the cDNA primer which does 
not include a poly or cligo dT tail in the 5 ' end has the 
following structure 

5'-N x TTA-3' or 5'-N x CTA~3' or 5'-N x TCA-3' f 

wherein N is A, G, T, or C, and x is an integer 1 <; x <; 20. 
Zz will be clear chat this corresponds to cDNA priming set- 
ting out from any translation stop codon. As for the above 
embodiments utilising a poly- or oligo-dT tailed primer, it 
is, by preparing primers having all possible permutations 
represented in the group N x , possible to compose the primers 
so as to correspond to any possible sequence preceding a stop 
codon, Thereby ensuring priming of all sequences having a 
stop codon in their sequence. 



Steo b) 



This step is carried out by mett 
is, however, preferred that ster 
ditions which minimize the form 
nucleotides in the first and s- 
stranded cDNA orocedure can b~ ; 



vods well known in the arc. It 
; b) is carried out under con- 
■-ion zz mismatches between 

-:TNA strands. The double 
■ - .. i ;; iT.'.ed according to Stan- 
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dard methods as described in Sambrook et al . (1939). However 
since standard polymerases can have difficulty in synthesi- 
sing regions containing secondary structures or with high GC- 
content, thermostable RNase H (Hybridase Thermostable RNase 
K ( US 5,268,289) and thermostable rBst DNA polymerase from 
Bacillus stearothermophilus help overcome some of the limita- 
tions that standard polymerases (low temperature polymerases) 
suffer from. 

Step c) 

In one embodiment of the invention the ligated cDNA fragments 
obtained in step b'» are subiected to a molecular amplifica- 
tion procedure so as to obtain amplified cDNA fragments, 
wherein is used a set of amplification primers having the 
general formula 

5 ' - Con, -N nl - 3 ' I 

wherein Con- is a sequence identical to either Con x or 
Con 2 or both, N is A, G, 7 or C, and nl is an integer a 
0, wherein at least one sez of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con x or Con 2 . 

In another embodiment, after the preparation and optional 
subdivision of the mRNA, each of the different pools of cDNA 
is digested with at least one restriction enzyme to produce 
fragments of a size which can be separated using an appropri- 
ate size fractionation method. 

The choice of restriction enzyme is based largely on the 
frequency of the cleavage sites in a given cDNA pool. Too 
many cleavage sites in each cCMA fragment will result in zoo 
small fragments, and vice vers.-;, 'p:. imally, the at least one 
enzyme should cleave eve ry cDr;.*. : , yield fragments of the 
desired size. Statistically, i' . possible to cleave 
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every cDNA, but on the other hand a very large percentage can 
be cleaved by choosing a suitable enzyme or combination of 
enzymes. It is preferred that the method of the invention 
utilises at least one restriction enzyme chosen so as to 
5 ensure that at least 60% of cDNAs are cleaved, but higher 

percentages such as at least 65%, at least 70%, at least 75%, 
at least 80%, or even at least 85% are more preferred. 

Preferably the invention should use restriction enzymes chat 
leave protruding ends (sticky ends) at the termini of the DNA 
10 after digestion in step c) , since this greatly facilitates 
the introduction of the adapter fragments in step d) . 

As will appear from the above, the frequency with which the 
restriction endonuclease cleaves is important. The at least 
one restriction enzyme is preferably chosen so as to cleave 

15 each complete cDNA into an average of about 3 fragments. It 
will be understood that some cDMAs obtained from preceding 
steps will not be cut at all (although this is a rare inci- 
dence when the restriction enzyme (s) is/are carefully chosen) 
whereas others will be cut with a high frequency. It has come 

20 out that use of a rare 4 base cutter as at the least one 
restriction endonuclease (such as the 4 base cutter Acil, 
Alul, Bfal, BstUI, Csp6I, Dpnl, DpnII, Haelll, Hhal, HinPlI, 
Hpall , Mbol , Mnll , Msel , Mspl , Nlalll , P.sal , Sau3AI , Tail , 
TaqI, and Tsp5 09I) ensures the optimum performance of the 

25 inventive method. By use of such a rare 4 base cutter, the 

use of only 1 restriction enzyme in step c) is sufficient and 
results in superior output. 

Alternatively, a combination of restriction endonucleases can 
be used wherein a balance of e.g. 6 base cutters and 4 base 

30 cutters ensures a reasonable distribution of fragment sizes. 
For instance the use of a first restriction enzyme (e.g. a 6 
base cutter) which statistically cleaves at least 20% of 
complete cDNA derived from the mRJIA sample into two sub f rag - 
ments, and of a second restriction enzyme 'e.g. a 4 base 

35 cutter) which statistically cleave: v: least 50% of said 
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subfragments into 3 further subfragments, will, also result in 
a series of fragments suitable for later size fractionation. 



Ste p d) 



The mixture (s) obtained in step c) are then subjected to a 
reaction wherein adapter fragments are added to both ends of 
the double stranded cDNA fragments obtained. As mentioned 
above, this part of the procedure is greatly facilitated by 
the cleaved cDNA fragments havi-g protruding "sticky" ends, 
because pre-designed adapter fragments which fit to these 
protruding ends can easily be prepared. 

The adapter ;or anchor) fragments are added to the cleaved 
fragments in order to obtain "order in chaos" in the sub- 
sequent step. By adding known sequences to the termini of the 
cleaved fragments, one creates targets for specific amplifi- 
cation primers which can be designed specifically with the 
aim of amplifying sequences complying to the adapter frag- 
ments. The material thus obtained (primary template) can be 
pre-ampiif ied, using primers complementary to the ligated 
adaptor sequences, giving rise to secondary template. The 
pre-amplif ication of primary template allows virtually un- 
limited amounts of template to be produced from one RNA 
preparation, avoiding the need for repeated isolations. 

The adaptor sequence is thus selected so as to serve as the 
starting point for DNA polymerisation in e.g. a PCR reaction. 
The adaptor sequences are constructed in such a way that the 
specific endonuciease sites are not regenerated after liga- 
tion of said adaptor. 

In a preferred embodiment at least one termination fragment 
is also ligated to the 3' -end of single strands of cleaved 
cDNA fragments, said at least one termination fragment intro- 
ducing a block against DNA polymerization in the 5 ' -*3 ' direc- 
tion setting out from the at least one termination fragment 
and said at least one terminer.;.?:; fragment being unable to 
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anneal to any primer of the at least two primer sets in step 
e) during the molecular amplification procedure. 

The above is a very important procedure when combined with 
the use of detection effected by labelled primers in the 
amplification step, wherein only one member of the pair of 
primers is labelled whereas the other is designed to split up 
the amplified products according to their base composition 
adjacent to the adapter fragment. One important feature is 
that a t ~ngle stranded cDNA fragment which has been provided 
with a termination fragment will net be amplified, because no 
primers will be able to anneal to the products of a first 
round polymerisation wherein such a fragment was the templa- 
te, see Figure 7. Secondly, the approach opens for the possi- 
bility of removing background "noise" in a subsequent detec- 
tion phase. 

Normally, the at least one termination fragment comprises or 
is a chemically modified nucleotide sequence, such as for 
instance a nucleotide sequence which comprises a dideoxy- 
nucleotide in the 3' -end; this termination technique is well- 
known from e.g. the chain- termination sequencing technique 
according to Sanger. Under normal circumscances , the dideoxy- 
nucleotide should, according to the invention, be covaiently 
attached to the nucleotide strand so as to avoid loss of the 
dideoxynucieotide during subsequent rounds of amplification. 
Superior stabilisation is attained if the dideoxynucieotide 
is phosphory lated . 

As mentioned above, the ligation of adapter and/or termina- 
tion fragments to the cleaved cDNA fragments in step d) is 
conveniently achieved by annealing the adapter fragments to 
sticky ends of the cDNA resulting from the cleavage in step 
c) and subjecting the product to the action of an enzyme 
having DNA iigase activity. Any suitable DNA ligase known in 
the art can be used. 
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Step ej 



Step e ) of the method of the invent ion resul ts in the final 
sorting of the modified cDNA fragments from step d) . As step 
b) , c) and d) are combined in the broadest embodiment of the 
invent ion , step e ) corresponds to step c ) of this embodiment 
described above . 



The primers having the structure of formula I (step c) or II 
(step e) are designed so as to selectively amplify synthesi- 
zed double stranded cDNA fragments obtained in step b) or 
predef ined subsets of the adapted fragments obtained in step 
a) . A number of ways this can be done may be envisaged, but 
the main strategy is to prime amplification in a series of 
separate reactions where the nucleotide sequence of one 
primer in one reaction ensures that the amplified products o 
that reaction are different from those obtained from any of 
the ether reactions and that all the reactions result in 
amplification of all fragments obtained from step b) or d) , 
respectively. 

Even though the at least one set of amplification primers of 
formula I or II wherein has a nl which is > 0, it is prefer- 
red that nl = i, nl = 2, ni = 3 , or nl = 4 in one of the primers, 
because the number of primer fragments to be used in the 
reactions in order to cover all possible nucleotide stretche: 
adjacent to the Con or adapter fragment is easily manageable 
For instance, if nl=5, it would be necessary to use 4 5 =1024 
different primers in order to obtain amplification of aii 
possible nucleotide sequences adjacent to the relevant adap- 
ter fragment, and since the preferred embodiment of the 
invention requires that each such primer is used in a sepa- 
rate reaction, the work involved would be problematic. 

It is also preferred that in one of the primers n=0, and it 
is especially preferred that this primer is labelled, in 
order to facilitate determination of the amplified fragments 
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Hence, in the most preferred embodiments of the invention, 
the adapted cDNA fragments are amplified in a number of sepa- 
rate reactions wherein a labelled primer is used (which is 
normally identical in all reactions) and at least one non- 
labelled primer which is a member of the set of primers 
described above where n>l. It is preferred that this set of 
amplification primers of formula I or II wherein nl>0 compri- 
ses all possible combinations and permutations of A, G, T, 
and C in the group N nl , since this will ensure that all 
possible cDNA fragments can be amplified by the set. 

Hence, the ligaced cDNA fragments are sub-divided into a 
number of pools prior to the molecular amplification in step 
e) , and each pool is subjected to the amplification using a 
subset of the set of amplification primers, and in the most 
preferred embodiment the ligated cDNA fragments of step d) 
are subdivided into 4 nl pools which are each subjected sepa- 
rately to step e) wherein is used one amplification labelled 
primer as described above (nl=0) and one primer from the set 
of amplification primers as defined above (n>0) , said one 
primer being distinct from any one of the primers used for 
amplifying any of the other pools. By using this approach, 
the originally reverse transcribed and cleaved cDNA fragments 
are subdivided into 4 nl pools which can each be subjected to 
further steps. 



Further steps and applications 



The material obtained from the above - described series 
reactions can now be utilised in a number of ways. Normally, 
a further step of separating amplified fragments obtained 
from the molecular amplification procedure is performed. This 
yields a mixture of amplified fragments which are separated 
e.g. by size separation, by mobility in a gel electrophoresis 
or by any suitable chromatographic method. Furthermore, a 
step of identification (e.g. by visualization of these sepa- 
rated fragments) is normally carried out for "book-keeping 
purposes"; the separated mixture of fragments will normally 
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be compared to some kind of reference which may be material 
derived from the same or another cell type, 

Visualization of the separated fragments can, as mentioned 
above, be achieved by one of the primers in the amplification 
reaction being labelled, but other methods are of course 
available. For instance, a specifically labelled probe which 
e.g. binds to one of the adapter sequences will visualise the 
fragments, but also labelled nucleotides which have been 
incorporated in the fragments during the amplification pro- 
cedure (e.g. a PCR) will of course be a suitable means for 
detection (e.g. by incorporating radioactive or fluorescent 
alpha dNTP into the cDMA fragment during PCR, where N = A, C, 
T, U or G) . 

However, it is preferred that visualisation of specific RNA 
Derived Fragments (RDFs) is achieved using primers which are 
radioactively or f luorescently labelled and are homologous to 
the adaptors. The comparatively high annealing temperatures 
(touch-down from 65°C to 56°C) which are preferably used 
ensure that polymerisat ion events will predominantly origina- 
te from perfect priming of adapter sequences and adjacent 
selective bases. Band intensities are largely a function of 
initial template concentration, whereas band intensities of 
the original Differential Display methods are dependent on 
the quality of the match between the individual template and 
primer. The visualisation of rare mRNAs using the present 
inventive methods will be less hampered by the over- represen- 
tation of signal from highly abundant mRNAs . As in the case 
of arbitrary priming, the mismatch amplification and abundant 
RDFs always out -compete the amplification of rare fragments 
base pair perfectly. Our experiments suggest that as few as 
100 molecules can be routinely detected in a given template. 
This corresponds to less than 1 transcript per cell in the 
original tissue. 

One interesting part of the invention relates to the use of 
the above-described methods in bioinf ormat ics . In short, 
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known DNA sequences are inputted into a computer database, 
and on the basis of such sequences a comparison with a real- 
life run of the above-described methods can be performed. In 
this way, bands in a gel obtain«u from the methods of the 
invention can be unambiguously identified with respect to 
sequence, origin and even functionality. Hence, this part of 
the invention pertains to a method for determining the pre- 
sence of an expression product in a cell cr group of cells, 
the method comprising providing an RNA- containing sample from 
the cell or group of cells and objecting the sample to the 
method described above, and thereafter performing a compari- 
son of che thus identified amplified cDNA fragments with a 
database output, said database output comprising a computer- 
generated list of molecular weights of restriction DNA frag- 
ments of known sequences, said list being prepared by- 
inputting and storing DNA sequence data in a database as 
virtual DNA sequences (these can be obtained and updated 
regularly from any database containing information about 
gene sequences from the relevant organism or cell type) , 
subsequently simulating cleavage of the virtual DNA 
sequences with the at least one restriction nuclease and 
storing the resulting simulated cleavage products as 
virtual cleaved DNA fragments (such simulation is relati- 
vely uncomplicated, since the recognition and cleavage 
patterns of a large number cf restriction enzymes are 
already known) , 

simulating ligation to the virtually cleaved fragments of 
the at least two adapter fragments and storing the re- 
sults as virtually ligated DNA fragments (again, this 
merely requires that input is provided of the structure 
of adapter fragments used in the real-life process), 
for each individual combination of primers used in step 
e) grouping the virtually" ligated DNA fragments suscep- 
tible to amplification by said combination of primers in 
the same group, 
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determining, in each group, the absolute and/or relative 
molecular weight of each virtually ligated DNA fragment, 
and 

outputting the content of each group in the form of a 
list comprising the absolute and/or relative molecular 
weights of the virtually ligated fragments in the group. 

It is preferred that a link is maintained between each member 
of the output list and the original sequence from which such 
a member has been derived. This can e.g. be done by linking 
the input DNA sequence data to data relating to the genetic 
origin of the DNA sequence data ana optionally to data rela- 
ting to functional features relating to the genetic origin 
and thereafter maintaining the information as a pointer back 
in the system to said sequence. Hence, the output indication 
will conveniently further comprise information about the 
genetic origin of the virtually ligated DNA fragment and 
optionally information about functional features associated 
with the genetic origin. 

For ease of use of such a bio- inf ormatic system, it is nor- 
mally necessary that 1) either the comparison is performed by 
inputting the identified amplified cDNA fragments in a format 
which allows automated comparison with the database output, 
or 2) the database output is outputted in a format which 
allows for direct comparison between the separated amplified 
cDNA fragments and the database output. For instance, if the 
visualized and separated cDNA fragments from step e) have 
been run on a jel, it will be possible to either read a 
digital reproduction of the gel pattern into the computer and 
let the computer compare this input with the computer gene- 
rated pattern, or alternatively, to output the computer 
generated pattern in such a manner that it resembles an 
electrophoresis gel pattern. - 



Another part of the invention pertains to the use of the 
inventive method for comparing expression levels in different 
cells. One way of doing this is to determine the change in 
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expression, compared to the expression in a reference cell or 
reference group of cells, of an expression product in a cell 
or group of cells which has been subjected to a first set of 
conditions influencing the expression pattern of said cell or 
group of cells, said reference cell or group of cells being 
subjected to a second set of conditions, the method compri- 
sing providing an RNA- containing sample from the cell or 
group of cells and subjecting the sample to the method of the 
invention for sub-division, thereby obtaining data describing 
the amplified cDNA fragments derived from the sample, pro- 
vicing reference data describing amplified cDNA fragments 
derived from an RNA- containing reference sample from the 
reference cell or reference group of cells, the reference 
data being obtained by having previously subjected the refe- 
rence sample :o the method of the invention, subsequently 
performing a comparison of the data and the reference data to 
identify the cDNA fragments which are expressed at different 
levels in the two data sets, and thereafter using the diffe- 
rentially expressed cDNA fragments to determine which expres- 
sion products are subject to a change in expression level. In 
other words, the method of the invention is carried out twice 
on the basis cf two different RNA samples derived from cells 
subjected to differing conditions. 

Normally, the data and reference data are selected from the 
aroup consisting of the apparent molecular weights of the 
amplified DNA fragments, the M r of the amplified DNA frag- 
ments, the absolute amount of the amplified DNA fragments, 
and the relative amounts of the amplified DNA fragments. The 
reference data can further be extracted from a database 
containing the reference data defined above and optionally 
further information relating to the genetic origin of each 
amplified cDNA fragment from the reference. 

Related to the above, the invention also allows for diagnosis 
of disease which is character • by h deviating (increased 
or reduced) expression level -r: L~ast one expression pro- 
duct in at least one cell typ-.- . :he method comprising pro- 
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viding an RNA- containing sample derived from the at least one 
cell type, subjecting the sample to the method of the inven- 
tion thereby obtaining data describing the amplified cDNA 
fragments derived from the sample, providing reference data 
describing amplified cDNA fragments derived from a RNA- con- 
taining reference sample derived from the same type of cell 
from a subject not suffering from the disease, the reference 
data being obtained by having previously subjected the refe- 
rence sample to the method according to the invention, and 
subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
known to be related to the disease, and assessing whether a 
significant difference in the data and reference data exists 
sc as to establish whether the expression level of the ex- 
pression product deviates or not. 

As for the er^bodiment above, also here the data and reference 
data are selected from the group consisting of the apparent 
molecular weights of the amplified DNA fragments, the M r of 
the amplified DNA fragments, the absolute amount of the 
amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments, and also here the reference data can 
be extracted from a database containing the reference data 
defined above and optionally further information relating to 
the genetic origin of each amplified cDNA fragment from the 
reference 

Further, the invention provides a method for treatment of a 
disease which is characterized by a deviating (increased or 
reduced) expression level of at least one expression product 
in at least one cell type, the method comprising providing an 
RNA- containing sample derived from the at least one cell 
type, subjecting the sample to the method of the invention 
thereby obtaining data describing the amplified cDNA frag- 
ments derived from the sample, providing reference data 
describing amplified cDNA fragr-r.rs derived from a RNA - con - 
taining reference sample deriv- : ::rjm the same type of cell 
from a subject not suffering t\ v. the disease, the reference 
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data being obtained by having previously subjected the refe- 
rence sample to the method according to the invention, and 
subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
known to be or suspected of being related to the disease, and 
assessing whether a significant difference in the data and 
reference data exists so as to establish whether the expres- 
sion level of the expression product deviates or not. 

If the e pression produce is reduced, the disease may be 
treated by delivering the expression product; if the expres- 
sion produce is increased, the disease may be treated by 
delivering an inhibitor ''e.g. an antibody) against the ex- 
pression product. The scope of the present invention includes 
an expression product identified by the method of the inven- 
tion as such as well as methods for treating a disease which 
method has been provided by means of the method of the inven- 
tion . 

The mixtures of amplified fragments obtained from step e) of 
the method of the invention may also be used for preparing a 
surface (chip) coated with cDNA fragments. This can be done 
by 

subjecting an RNA- containing sample to the subdivision 
method of the invention including separation steps, and 

transferring the separated amplified cDNA fragments to a 
chip surface adapted to stably bind the separated ampli- 
fied cDNA fragments while maintaining the spatial relati- 
ve distribution pattern thereof. 

Alternatively, such a chip can be prepared by 



subjecting an RNA- containing sample to the method of the 
invention without performing the separation, and there- 
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separating, by electrophoresis, the thus obtained ampli- 
fied cDNA fragments on a particular surface adapted to 
stably bind the separated amplified cDNA fragments while 
maintaining the relative distribution pattern after 
electrophoresis. In this embodiment, the electrophoresis 
is preferably in the form of microelectrophoresis. 

Transfer to the surface is preferably accomplished by a 
electrophoretic blotting technique, and/or by well-known 
photo- act ivated organic or inorganic chemistry coupling 
techniques . 

The invention also pertains :o a surface obtainable by the 
above-mentioned method for the preparation thereof. Such 
surfaces are cons ide red novel and inventive , s ince known "DNA 
chips" rely on specific introduction of an array of nucleic 
acid fragments of known structure, whereas the present method 
provides for "a semi-array" containing cDNA fragments charac- 
terizing a specific "situation" for a specific cell type, 
according to Figure 3 . 

Such a surface can i.a. be used for screening for genes 
within a gene family. The "array chip" is provided and there- 
after a label led probe (which is a representat ive of a gene 
family) is allowed to hybridize to the chip under low strin- 
gency i.e. under conditions as described at pages 94-106 in 
"Nucleic acid hybridisation. A practical approach" edited by 
ED Hames & S J Higgins, IRL Press. A number of fragments 
coupled to the chip will hybridize to the probe, and these 
fragments can subsequently be identified, isolated and se- 
quenced/ characterized in order to determine whether they are 
representatives of the same gene family. 

Another use of such " semi - arrays " is for determining the dif- 
ference in expression pattern between a first cell or type of 
cells and a second cell or type tells, the method compri- 
sing providing samples of labelled RNA :r cDNA from the first 
and second cells or cell types and subsequently contacting 
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each of these samples with a chip surface as described above, 
and subsequently detecting the amount and distribution of 
bound labelled RNA or cDNA from each sample. 

Under all circumstances, the chip surface with the cDNA bound 
thereto can e.g. be produced by the methods described in EP-0 
654 061. 

Yet another part of the invention pertains to a method for 
screening for interactions between a pre-selected protein and 
a polypeptide fragment, the method comprising preparing a 
sub-divided library of amplified cDNA fragments resulting 
from seep e ) , optionally adapting the terminals of the mem- 
bers of the library so as to facilitate insertion in a vec- 
tor, inserting the fragments into vectors, transforming a 
population of suitable host cells with the vectors, culturing 
the host ceils under conditions which enable expression of 
correctly inserted cDNA fragments by the host cell, and 
subsequently assaying polypeptide fragments encoded by the 
inserted cDNA fragments for interaction with the pre-selected 
protein . 

One convenient way of achieving this is by way of a two- 
hybrid technique, wherein the host cells are eukaryotic cells 
(such as fungal cells, especially yeast cells) which are 
mated or transfected with nucleic acid material encoding the 
pre-selected protein, successful mating/ transfect ion of the 
cell ( s ) resulting in a cell or cells wherein the interaction 
between the y^e- selected protein and a polypeptide fragment 
gives rise to a detectable signal. 

Such methods have recently attracted a great deal of atten- 
tion, i.a. as a consequence of the disclosure in Fromont- 
Racine et. al . , Nature Genetics 16, 277-282 (1997), which is 
incorporated by reference herein. 

One convenient system for providing the detectable signal is 
by use of Green Fluorescent Protein, disclosed in EP-A-0 569 
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170, wherein changes in fluorescent spectrum due to inter- 
actions are used as reporter. 

Finally, the invention pertains to a composition for use in 
reverse transcription of RNA, the composition comprising 

a) a first enzyme having reverse transcriptase activity 
at temperatures not exceeding 55°C 

b) a second enzyme having reverse transcriptase activity 
at elevated temperatures in the range of 45°C - 95°C (and 
especially the temperatures discussed above for perform- 
ing reverse transcription at elevated temperatures) , 

said second enzyme having a substantially higher activity 
than said first enzyme in catalyzing reverse transcription a: 
said elevated temperatures . It is preferred that the first 
enzyme has a substantially higher activity than said second 
enzyme in catalyzing reverse transcription at said tempera- 
tures not exceeding 55°C, and it is also preferred that the 
second enzyme has a substantially higher activity than said 
first enzyme in catalyzing reverse transcription at said 
temperatures exceeding 45°C. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

First, the drawing will be briefly described. 

Fig. 1 

Basis of Display n ^ Differentially Expressed Transcripts. 
Fig. 2 

Anchor and PCR primer design. 
Fig. 3 

An autoradiogram of a DODET gel using the cellular set-up 
described in Example 1; rat pheochromocytoma PC12 cells were 
stimulated with the Nerve Growth Factor f NG F ) and Epidermal 
growth factor (EGF) . 
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Lanes 1-24, reverse transcription using the anchored poly T 
primer 5' -T 25 AA-3' 

Lanes 25-48, reverse transcription using the anchored poly 
T primer 5 ' -T 25 GC-3 ' 

Lanes 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45 represent 
the PC12 cells not treated. 

Lanes 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42 and 46 repre- 
sent the PC12 cells treated with the NGF factor for 60 minu- 
tes . 

Lanes 3, 7, 11, 15, 19, 23, 27, 31, 35, 39, 43 and 47 repre- 
sent the PC12 cells treated with the NGF factor for 90 minu- 
tes 

Lanes 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44 and 48 repre- 
sent the PC12 cells treated with the EGF factor for 90 minu- 
tes . 

Lanes 1-48, using the following pairs for the pre-PCR 
amplifications : 

TagI pre-amplif ication primer: 5 ' - CAGCATGAGTCCTGACCGA 
Bell pre-amplif ication primer: 5 ' - CTCGTAGACTGCGTACCGATCA 

For the second PCR amplification the following primer pairs 
were used: 
Lanes 1-4 

5' -CATGAGTCCTGACCGAA 

5' -GACTGCGTACCGATCAA (5' end labelling) 



Lanes 5 - 3 

5 ' - CATGAGTCCTGACCGAA 

5' - GACTGCGTACCGATCAC (5' end labelling) 



Lanes 9 - 12 

5 ' - CATGAGTCCTGACCGAA 

5' -GACTGCGTACCGATCAC 



(5' end labelling) 
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Lanes 13 



- 16 



5 ' - CATG AGTCCTGACCGAA 



5' -GACTGCGTACCGATCAT 



(5' end labelling) 



Lanes 17 



- 20 



5 ' - CATGAGTCCTGACCGAC 



5 ' - GACTGCGTACCGATCAA 



(5' end labelling) 



Lanes 21 



- 24 



5 ' - CATGAGTCCTGACCGAC 



5 ' -GACTGCGTACCGATCAC 



(5' end labelling) 



Lanes 2 5 - 4 8 

Repeated primer combination from lanes i - 24 
Fig. 4a 

Northern Blot of RDF01 sequence from cellular total RNA. 
a) PC12 cells not treated, b) NGF treatment for 60 minutes, 
c) NGF treatment for 90 minutes, and d) EGF treatment for 90 
minutes . 

Fig. 4b 

Loading control, RNA extracts were elect rophoresed on a 1.2% 
agarose gel containing ethidium bromide, used as a control to 
determine the relative concentration of RNA in each lane, a, 
b, c, d same as in Figure 4a 

Fig. 4c 

Northern Blot RDF02 sequence from cellular total RNA . 
a) PC12 cells not treated, b) NGF treatment for 60 minutes, 
c) NGF treatment for 90 minutes, and d) EGF treatment for 90 
minutes . 



Loading control, RNA extracts were elect rophoresed on a 1.2% 
agarose gel containing ethidium bromide, used as a control to 
determine the relative amount of RNA in each lane, a, b, c, d 



Fig. 4d 
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Fig. 5 

Searching for genes modulated by a growth factor. 
Lane 1 Size marker in bp (150 bp, 200 bp, 250 bp). 

Lanes 2-6 Amplification primer 5 ' - Com- N nl - 3 1 where N nl 

is GAA 

Lanes 7-11 Amplification primer 5 * -Com-N nl -3 1 where N nl 

is GAC 

Lanes 12-16 Amplification primer 5 ' -Com-N nl - 3 ' where N nl 

is GAG 

Lanes 17-21 ' Amplification primer 5 ' -Ccm-N nl - 3 ' where N nl 
is GAT 

Lanes 22-26 Amplification primer 5 * - Com-N nl - 3 * where N nl 
is GCA 

In lane 11 a downreguiat ion is observed after 6 days treat- 
ment, whereas in lane 16 an upregulacion is observed after 6 
days treatment. Both modulations are due to the growth fac- 
tor, since regulation is seen only when the active grov/th 
factor is present. 

Fig 6 

Searching for genes involved in bacterial resistance. 
Lane 1: Size marker in bp (150 bp, 200 bp, 250 bp, 

300 bp) . 

Lanes 2-5 Amplification primer 5 ' - Com- N nl - 3 ' where N nl 

is GAA 

Lanes 6-9 Amplification primer 5 ' - Com- N nl - 3 ' where N nl 

is GAC 

Lanes 10-13 Amplification primer 5 " -Com-N nl -3 ' where N nl 

is GAG 

Lanes 14-17 Amplification primer 5 ' -Com-N nl -3 ' where N nl 
is GAT 

Lanes 18-21 Amplification primer 5 * -Com-N nl - 3 * where N nl 

is GCA 

Lanes 22 -25 Amplification" primer 5 ' - Com- N nl - 3 ' where N nl 

is GCC 



In lanes 8-9 a uownregulat ion 
upregulation is observed. Both: 



is observed, in lanes 20-21 an 
gene modulations are potent ia 
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genes involved in the resistance to the Bacteriamycin, Ino- 
sin . 



Fig. 7 

Principle of the technology used in Examples 4 and 5. 

After ds-cDNA synthesis the DNA is digested with one 4 base 
pair endonuclease and anchors are ligateu to the ds-cDNA 
ends. Using special design primers the expression profiles 
are obtained by amplifying the .nRNAs in different expression 
windows ( sub- fractions) . The number cf expression windows 
depends on the complexity of trv- sample i.e. 64 expression 
windows in eukaryotic. 



Fig . 8 

Principles of a gene discovery DNA surface (a DNA chip) . 

After size separation of the DNA fragments, the DNA fragments 
are transferred to a nylon membrane using an elect rophoret ic 
principle. The membrane is hybridized with a complex DNA 
probe generated using the principle of the invention. Alter- 
natively the membrane can be hybridized with one single gene 
to identify new members of a particular gene family. The 
membrane are in the x coordinates separated in 64 expression 
windows, and in the y coordinates separated in base pair size 
(from 50 base pair to 1200 base pair) according to principle 
described in figure 7. 



Fig . 9 

Principle of generation 64 pools of 3 ' END cDNAs 



Step I 

Production of single stranded cDNA using 5'-con 1 -T n V oligo- 
nucleotide where con 1 is an oligonucleotide between I- 100 
nucleotide, n is between 5-40 and V is a mixture of A, C and G. 
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Step 2 

Double stranded cDNA synthesis are produced using 5'Con 2 N x 
where con 2 is an oligonucleotide between 1-100 nucleotide, x is 
between 1-10, and N is a mixture of A, C, T and G. The ds-cDNA 
synthesis is synthesized by Klenow enzyme with the above - 
described oligonucleotide. 

Step 3 

Pre-amplif ication of double stranded cDNA to amplify the double 
stranded cDNA, the cDNA is PGR amplified using a combination of 
con, and con 2 primers. 

Step 4 

The pre -amplified cDNA is further amplified and separated in 64 
pools using a combination of a labeled con x and 64 con 2 NNN 
primers in a PGR amplification procedure, where NNN are 
combined in 64 different ways using the nucleosides A, T, G and 
C. 

Step 5 

Each of the 64 pools is separated using the Page electro- 
phoresis principle . 

EXAMPLES 

In order to verify the functionality of the invention, examp- 
les are described belov; in which a developmental eukaryotic 
cellular system, pheochromocy toma PC12, was employed. 

Nerve Growth Factor (NGF) induces growth arrest and neurone 
outgrowth in the in vitro PC12 cell system. Other growth 
factors, such as epidermal growth factor (EGF) , support 
survival and stimulate growth. NGF- induced genes, include the 
immediate early genes, which encode transcription factors, 
such as c-fos and c-myc. The products of the immediate early 
genes are thought to be involv-*i regulating the expression 
of genes, associated with the .r ;::al phenotype for example 
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In order to identify new early genes involved in neuronal 
differentiation and proliferation, the following DODET method 
is used for identify such genes. 

In the following examples, it is demonstrated how efficiently 
the method of the invention can be applied to such cellular 
systems . 

EXAMPLE 1 

The rat pheochromocvt oma ?C12 ceils were grown [in vitro) in 
the presence and absence of Nerve Growth Factor (NGF! and 
epidermal growth factor ' F.G F under growth conditions descri- 
bed elsewhere (Saltiei et ai. 1995;. 

The total RNA was isolated using the standard single -step 
method by Chomczynski and Sacchi according to Sambrook et al 
1989 . 

Total RNA concentration was determined spectrophotometrically 
and then adjusted to 0.2 ug/ul. This RNA was used directly in 
the Northern analysis. 

For DODET 4 x 0.5 /.tg total RNA was reverse transcribed in 
separated pools using the primer 5'-T 25 AA-3'. The same pro- 
cedure was performed using the 5'-T 25 GC-3' poIy-dT anchored 
primers, giving a total of 2 x 4 x 0.5 /zg of RNA. 

First strand synthesis 

20.0 pi total RNA amount between 0.3 to 1.0 ^9 RNA 

3.0 pi 5'-T 25 AA-3' Cone. 100 ng/al or B'-T^GC-^' Cone. 



2.0 fAl 
1.0 ill 



100 ng//xl 

10 x cDNA buffer (buffer B from Epicentre Techno- 
logies ft R19250) 

dNTPs (25 mM from r : . ; rmac : a Biotech) 
Superscript II RT ;;. //!■ i.Gibco BRL # 18064- 



014) 
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5.0 /xl Retrotherm RT (1 U/fxl) (Epicentre Technologies 

#R19250) 
14 .0 ill H 2 0 

To obtain high specificity, the cDNA reaction was incubated 

at 50°C for 30 minutes followed by 1 hour incubation at 70°C. 



Second strand synthesis: 



To the first strand reaction, add the following components 



15.0 m- 10 x cDNA buffer 

3.0 /.il Kybridase Thermostable RNase !1 U/^l) {Epicentre 

Techno I cgies a r._^ 0 50 j 
1.0 /.il rBst thermostable DNA polymerase {l U/>1;} (Epi- 

centre Technologies SBH1100) 
31.0 jLil H 2 0 

Incubate at 65 °C for 1 hour. 



The resulting double stranded cDNA was phenol extracted and 
precipitated and resuspended in 20 fil cf H 2 0. Half of this 
volume was checked on gel; if a smear between 100 bp and 3000 
bp was observed, the rest of the cDNA was used for DODET 
template production. The resulting cDNAs were digested with 
10 U of each of the thermostable restriction enzymes TaqI ana 
Bell at 50°C for 2 hours. To this mixture, DODET adapters 
were added and ligated to the ends of the restriction frag- 
ments with T 4 DNA ligase (1U) resulting in the primary tem- 
plate. 8-15 cycles of non- radioactive pre- amplification, 
using primers complementary to the DODET adapters, were 
performed on a small aliquot (l/10 th volume) of the primary 
template (94 C C denaturation; 30 s, 56°C annealing; 30 s, 72°C 
polymerisation; l min) . The products of the amplification 
(termed secondary template': were also checked on a 1.5% 
agarose gel. As expected, fragment sizes were predominantly 
between 100 bp and 1000 bp. All amplification reactic.:s were 
carried out on a PE-9600 thermocycler using Taq DNA-polyme- 
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rase, both from Perkin Elmer Corp. (Norwalk, CT , USA) . The 

final template was then diluted 10 fold with H 2 0. 

The adapters ligated to the restriction fragments, the pre- 
amplification and active PCR are given below: 

Taql adapter : 5 ' - CAGCATGAGTCCTGAC 

TACTCAGGACTGGC - 5 ' 

Taql pre-amplif ication primer: 5 ' - CAGCATGAGTCCTGACCGA 

TagI amplification primer: 5 ' - CATGAGTCCTGACCGAN 

f N = A or C or G or T) 



Bell adapter: 5 ' - CTCGTAGACTGCGTACC 

CTGACGCATGGCTAG- 5 ' 



Bell pre-amplif ication primer: 5 ' - CTCGTAGACTGCGTACCGATCA 

Bell amplification primer: 5 ' -GACTGCGTACCGATCAN 
(N = A or C or G or T) 

For PCR all the different combinations of one extension 
(denoted as N above) were available, giving a total of 4~ 
primer combinations. All oligonucleotides were obtained from 
DNA Technology (Aarhus, Denmark) . 

Radioactive labelling of the Bell primer was performed using 
1U of T 4 polynucleotide kinas-. Thermocycling was carried out 
essentially as described above but with 35 cycles and includ- 
ing an 11 cycle touch-down (the annealing temperature was 
reduced from G5°C to 56 °C in 0.7°C steps for 11 cycles and 
subsequently maintained at 56°C for 23 cycles) . Samples were 
then boiled after the addition of dye and 50% formamide and 
separated on a 5% polyacrylamide sequencing type gel (GIBCO 
BRL Life Technologies Inc., Ga i t hersburg , MD , USA). All gels 
were run at standard conditions, such that the 70 bp marker 
was 3 cm from the bottom of che jei, giving good resolution 
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between 70-800 bp. Gels were then dried directly onto Whatman 
3M paper on a slab gel dryer. Labelled DNA fragments were 
visualised by autoradiography. Gels and films were positio- 
nally marked prior to development. The 1 base selective ex- 
tensions were chosen empirically to yield approximately 50 
radioactively labelled fragments per lane. 

Bands, identified on the autoradiogram as interesting, were 
lined up with markings on the film and the dehydrated gel and 
were excised. Excised fragments were monitored for activity. 
The gel fragments were isolatea using GENECLEAN (BIO101, 
California USA) . DNA was then recovered according to the 
manufacturer's recommendations. DNA fragments could then be 
reamplified using the same ?CR conditions and primers as used 
in the initial PCR; however, 15 cycles generally yielded 
sufficienc product for cloning. Cloning was achieved using 
unpurified PCR product and the vector display-pl23T (Display 
Systems Biotech, USA) . Conditions were used as recommended by 
the manufacturer. 



EXAMPLE 2 

Figure 3 shows a typical DODET gel produced by amplification 
of template derived from treatment of PC12 ceils with NGF or 
EGF. 

Total RNA was reverse transcribed with the 5 ' - T 25 AA- 3 and 
T05GC-3' poly-dT anchored primers, and after anchor ligation 
pre-amplif ication with Bell and TagI pre-amplif ication orimer 
pairs was performed. 

6 out of 1G possible primer combinations are shown, using 1 
selective base, at each restriction enzyme site (Figure 3) . 
The largest visible products (Figure 3} are approximately 
1000 bp in size and the lower end of the gel corresponds to 
approximate 100 bp. In this size window an average of 50 
bands can be scored for each primer combinat ion . In Figure 3, 
various expression patterns can be defected. 
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Due primarily to the stringent conditions possible in DODET, 
resolution of the banding pattern is high while the level of 
background remains at acceptable levels (Figure 3) . Further- 
more, quite radical changes in the intensity of individual 
bands over the treatment period do not seem to affect the 
patterns of other bands in the same lane. 

It is, therefore, possible to conclude that the PGR remains 
proportionally independent on the concentration of individual 
substrates in the reaction. 

The use of an optimised combination of standard protocols 
described above for isolating, re - ampl if ying and cloning 
individual RDFs , has allowed the identification of a number 
of transcripts associated with differentiation and prolifera- 
tion events. 

Four RDFs were isolated for further analysis, Figure 3, bands 
a , b , c and d . 

Sequence analysis revealed that RDF a = RDF b and RDF c = RDF 
d, as illustrated in Figure 3. 

In all cases appropriate terminal sequences with the correct 
1 selective base extensions used in the PCR could be retriev- 
ed, demonstrating the stringency and fidelity of the system 
(data not shown) . 

EXAMPLE 3 

During scanning of the PC12 cellular systems treated with NGF 
or EGF with different primer combinations, two RDFs (designa- 
ted RDF01 and RDF02) exhibiting a differential expression 
during the NGF treatment were, isolated (RDF01 = RDF a = RDF b 
and RDF02 - RDFc - RDF d. in Figure 3) . 

After re-amplification, sub-cloning and DNA sequencing, 
further DNA analysis revealed two unknown RDFs unregulated 
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after 60 minutes NGF treatment or 90 minutes EGF treatment in 
the PC12 cellular system. The nucleotide sequences of both 
RDF01 and RDF02 show less than 10% homology to any existing 
gene in the GeneBank or EM3L databases. 

The expression of RDF01 and RDF02 was further analyzed using 
Northern blot (Figure 4a - 4d) . Here, transcripts could 
clearly be detected at 60 minutes NGF treatment or 90 minutes 
EGF treatment of the PC12 cells, confirming the results 
obtained using the DODET method, as illustrated in Figure 3. 

Experiments to clone the full length of RDF01 and RDFC2 , and 
biological characterisation of their involvement in the 
differentiation and proliferation, of the ?C12 cellular 
system are currently under investigation. 



EXAMPLE 4 



Searching for genes modulated by a growth factor. 

A human ceil line was treated with a growth factor and RNA 
was isolated a various time points as indicated below. 



1 Cell without any treatment {lanes 2, 7, 12, 17, 2 r \) 

2 Cell treated with helper agent, 1 day (lanes 3, 8, 13, 
18, 23) 

3 Cell treated with helper agent and growth factor, 1 day 
(lanes 4, 9, 14, 19, 24) 

4 Cell treated with helper agent, 6 days {lanes 5, 10, 
15, 20, 25) 

5 Cell treated with helper agent and growth factor, 6 
days (lanes 6, 11, 16, 21, 26) 



Human RNA was isolated and the gene discovery analysis was 
performed essentially as described in the legend to Fig. 3. 
5 out of 64 amplification primers are shown in Fig. 5, each- 
covering a certain portion of the mRNA pool in the human ceil 
line. The expression analysis was performed on an ALFexpress, 
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an automated fragment analyzer from Pharmacia Biotech, using 
a Cy5 label. 

In lane 11 a downregulat ion is observed after 6 days of 
treatment. In lane 16 an upregulation is observed after 6 
days of treatment. Both modulations are due to the growth 
factor, since regulation is only observed with the active 
growth factor present. 

EXAMPLE 5 

Searching for genes involved in bacterial resistance to 
antibiotics . 

A Listeria wonocylogia strain was treated with the 3accena- 
mycin, Inosin. RNA from a strain resistant to Inosin was 
further investigated . 

1. Bacterial clone I without any treatment (lanes 2, 6, 

10, 14, 18) 

2. Bacterial clone 2 without any treatment (lanes 3, 7, 

11, 15, 19) 

3. Bacterial clone 3 resistant to Inosin (lanes 4, 8, 12, 

16, 20) 

4. Bacterial clone 4 resistant to Inosin (lanes 5, 9, 13, 

17, 21) 

Bacterial RNA was isolated by standard techniques and the 
gene discovery analysis was performed according to Example 4 
and Fig. 5, with the exception that a 5 ' - NN1NTNNWYYA primer was 
used for first strand synthesis. 

6 of 64 amplification primers are shown in Figure 6, each 
covering a certain portion of. the mRNA pool in the prokaryo- 
tic cell system. The expression analysis was performed on an 
ALFexpress, an automated fragment analyzer from Pharmacia 
Biotech, using a Cy5 label. 
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In lanes 8 and 9 a downregulat ion is observed and in lanes 20 
and 21 an upregulation is observed. Both gene modulations are 
potential genes involved in the resistance to Inosin. 
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1. A method for preparing a normalized sub -divided library of 
amplified cDNA fragments from the coding region of mRNA 
contained in a sample, the method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer having the 
general formula 

5 , -Con 1 -dT n2 -V n3 -N n4 -.V 

wherein Con 1 is any sequence between 1-100 nucleotides, 
dT is deoxythymidiiiyl , V is A, G or C, N is A, G, C or T, 
n2 is an integer > 1, n3 is 0 or 1, if n3 is 0 then n4 is 
0, and if n3 is 1 n4 is an integer 2 0 , thereby obtaining 
first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, and a second cDNA primer with 
the general formula 



d ' -Con : -N x .3 



wherein Con 2 is any sequence between 1-100 nucleotides 
and can be different or identical to con x , N x is A, G, T 
or C , and x is an integer > 0 , in a appropriate enzyme/ - 
buffer solution which comprises the DNA pol I enzyme or 
the Klenow fragment of the DNA pol I enzyme, all four 
deoxyribonucieoside triphosphates and standard buffer and 
temperature conditions, thereby obtaining double stranded 
cDNA fragments, and 

c) subjecting the cDNA fragments obtained in step b) to a 
molecular amplification procedure so as to obtain ampli- 
fied cDNA fragments, wherein is used a set of amplifica- 
tion primers having the general formula 
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5' -Con 3 -N nl -3' I 

wherein Con 3 is a sequence identical to either Con x or 
Con 2 or both, N is A, G, T or C, and nl is an integer s 
0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con x or Con 2 . 

2. A method for preparing a normalized sub- divided library of 
amplified cDNA fragments from zL* coding region of mRNA 
contained in a sample, the method comprising the steps of 

a; subjecting the mRNA derived from the sample to reverse 
transcriction using at lease one cDNA primer, thereby 
obtaining first strand cDNA fragments, 



b- synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, thereby obtaining double 
stranded cDNA fragments, 

c) digesting the double stranded cDNA fragments with at 
least one restriction endonuclease , thereby obtaining 
cleaved cDNA fragments, 

d) ligating at least two adapter fragments to the cleaved 
cDNA fragments obtained in step c) , so as to obtain 
ligated cDNA fragments, and 

e) subjecting the ligated cDNA fragments' obtained in step 
d) to a molecular amplification procedure so as to obtain 
amplified cDNA fragments, wherein is used, for an adapter 
fragment used in step d) , 'a set of amplification primers 
having the general formula 



5 ' - Com 



II 
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wherein Com is a sequence complementary to at least the 
5' -end of an adapter fragment which is ligated to the 3'- 
end of a cleaved cDNA fragment, N is A, G, T, or C, and 
nl is an integer s 0, and wherein at least one set of 
primers has the general formula I where nl > 0, said at 
least one set being capable of priming amplification of 
any nucleotide sequence ligated in its 3' -end to the 
adapter fragment complementary in its 5' -end to Com. 

3. A method according to claim l or 2, wherein the mRNA is of 
eukarytic, Archae or prokaryotic origin. 

4. A method according co any of claims 1-3, wherein the 
reverse t rar.script io:i is performed under high stringency 
conditions . 

5. A method according to any of the preceding claims, wherein 
the reverse transcription is carried out at a temperature in 
the range from about 4 5°C to about 95 °C by use of an enzyme 
having reverse transcriptase activity at said temperature. 

6. A method according to claim 5, wherein the enzyme is 
thermostable, such as an enzyme selected from the group 
consisting of a DNA polymerase with reverse transcriptase 
activity derived from thermophilic eubacteria, such as Taq 
(Thermus aquaticus; , Stoffel (Thermus aquaticus) , Tht (Ther- 
mus thermophilic), Tfi/Tub {Thermus flavus), Tru (Thermus 
Ruber) , Tea (Thermus caldophiius! , Tfil (Thermus f iliformis) , 
Tbr (Thermus Brockianus) , Bst (B. Stearothermophilus) , Bca 
(3. Caldotenax n-G), Bcav (B. Caldovelox YT-F), FjSS3-B.l 
[Thermotoga FjSS3-3.1), Tina (Thermus Maritima) , UITma (T. 
Maritima) , Tli (T. Litoralis), Tli exo- (T. Litoralis) , 9°N-7 
(Thermococcus sp.) , BG-D (Pyrococcus sp.), Pfu (P. furiosus) , 
Pwo (?. woesei) , Sac (S. Acidocaldarius ) , Ssol (S. Solfatari- 
cus) , Tac (T. Acidophilus) , and Mth (Methananococcus Voltae) . 



7. A method according to any c : 
reverse transcription is cam- 
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range from about .25°C to about 55°C by use of an enzyme 

having reverse transcriptase activity at said temperature. 

8. A method according to claim 7, wherein the enzyme is a 
reverse transcriptase, such as a reverse transcriptase selec- 

5 ted from reverse transcriptase from AMV (Avian Myeloblastosis 
Virus), M-MuLV (murine M-MuLV pol gene), or HIV-1 (HIV 
virus) . 

9. A method according to any of claims 1-4, wherein the 
reverse transcription is carried out in two subsequent steps, 

10 the first step comprising carrying out reverse transcription 
as defined in claim 7 or 6, and the second step comprising 
carrying out reverse transcription as defined in claim 5 or 
6 . 

10. A method according to claim S, wherein reverse transcrip- 
15 cion in the two steps is effected by non- identical enzymes 

having reverse transcriptase activity. 

11. A method according to claim 10, wherein the non- identical 
enzymes are added separately in each step or are present in 
both steps. 

20 12. A method according to claim 11, wherein the activity of 

the enzyme which is active in the first step is substantially 
abolished in the second step. 

13. A method according to any of claims 9-12, wherein the 
enzyme effecting reverse transcription in the first step is 
25 reverse transcriptase from MMuLV, AMV or HIV-1 and/or the 

enzyme effecting reverse transcription in the second step is 
Tth or Taq. 



14. A method according to any of the preceding claims, where- 
in the at least one cDNA primer includes an oiigo or poly dT 
30 tail in the 3' end. 
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15. A method according to claim 14, wherein the at least one 
cDNA primer has the general formula 5 ' -dT n2 -V n3 -N n4 -3 ' , where- 
in dT is deoxythymidinyl, V is A, G, or C, N is A, G, C or T, 
n2 is an integer * 1, n3 is 0 or 1, if n3 is 0 then n4 is 0, 

5 and if n3 is 1 then n4 is an integer > 0. 

16. A method according to claim 15, wherein, when n3 is 1, 
3 x 4 n4 groups of cDNA primers are used, each group being 
distinct from any one of the other groups with respect to the 
structure -V n3 -N n4 - . 

17. A method according to claim 16, wherein the pool of mRNA 
is subdivided into 3 x 4 n4 aiiquots which are each subjected 
separately to step a) utilising one of the 3 x 4 n4 groups of 
cDNA primers, tnereby obtaining a subdivision of the first 
strand cDNA into 3 x 4 n4 separate pools. 

4 15 13. A method according to claim 16 or 17, wherein n4 is 0 or 

• 1 . 

19. A method according to any of claims 1-13, wherein the at 
least one cDNA primer does not include a poly or oligo dT 
tail in the 5' -end, or wherein at least two cDNA primers are 

20 used of which at least one includes a poly or oligo dT tail 
in the 5 ' -.end and of which a: least one second does not 
include a poly or oligo dT tail in the 5' -end. 

20. A method according to claim 1?, wherein the cDNA primer 
which does not: include a poly or oligo dT tail in the 5' -end 
has the following structure 

5'-N x TTA-3' or 5'-N x CTA-3' or 5'-N x TCA-3', 

wherein N is A, G, T, or C, and x is an integer 1 < x < 20. 



21. A method according to any of che preceding claims, where- 
in step b) is carried ou: under conditions which minimize the 
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formation of mismatches between nucleotides in the first and 
second cDNA strands. 

22. A method according to any of the preceding claims, where- 
in the at least one restriction enzyme is chosen so as to 
ensure that at least 60% of cDNA's are cleaved. 

23. A method according to any of the preceding claims compri- 
sing the use of at least one restriction endonuclease which 
upon cleavage of cDNA results in cleaved cDNA fragments 
having sticky ends . 

24. A method according to any of the preceding claims, where- 
in the at least one restriction enzyme is chosen so as to 
c l ea ve each complete cDNA into an average of about 3 frag- 
ments . 

25. A method according to any of the preceding claims, com- 
prising the use of a rare 4 base cutter as at least one 
restriction endonuclease, such as the 4 base cutter Acil, 
Alul, 3fal, BstUI, Csp6I, Dpnl , DpnII, Haelll, Hhal, HinPlI, 
Hpall, Mbol, Mnll, Msel, Mspl, Nlalll, Rsal, Sau3AI , Tail, 
TaqI, and Tsp509I. 

25. A method according to any of the preceding claims, where- 
in one restriction enzyme is used. 

27. A method according to any of claims 1-21, which comprises 
the use of a first restriction enzyme which statistically 
cleaves at least 20% of complete cDNA derived from the mRNA 
sample into two subf ragments , and of a second restriction 
enzyme which statistically cleaves at least 50% of said 
subf ragments into 3 further subf ragments . 

23. A method according to any of the preceding claims, where- 
in, in step d} , at least one termination fragment is also 
ligated to the 3' -end of single strands of cleaved cDNA frag- 
ments, said at least one termination fragment introducing a 
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block against DNA polymerization in the 5'-*3' direction 

setting out from the at least one termination fragment and 
said at least one termination fragment being unable to anneal 

to any primer of the at least two primer sets in step e) 
during the molecular amplif ^aJon procedure. 

29. A method according to claim 28, wherein the at least one 
termination fragment comprises or is a chemically modified 
nucleotide sequence . 

30. A method according to clr'm 29, wherein the chemically 
modified nucleotide sequence comprises a dideoxynucieot ide in 
the 3 ' - end . 

31. A method according to claim 30, wherein the dideoxy- 
nucleotide is covalently attached to the nucleotide strand. 

32. A method according to any of the preceding claims, where- 
in the ligation of adapter and/or termination fragments to 
the cleaved cDNA fragments in step d) is achieved by anneal- 
ing the adapter fragments to sticky ends of the cDNA result- 
ing from the cleavage in step c) and subjecting the product 
to the action of an enzyme having DNA ligase activity. 

33. A method according to any of the preceding claims, where- 
in the at least one set of amplification primers of formula I 
or II wherein nl is > 0 has nl = l, nl = 2, nl = 3, or nl=4. 

34. A method according to any of the preceding claims, where- 
in nl-0 in the at least one set of amplification primers 
having formula I or II . 

35. A method according to claim 34, wherein the set of ampli- 
fication primers having nl=0 in formula I or II is labelled. 

36. A method according to any cf the preceding claims, where- 
in the set of amplification primers having formula I or II 
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wherein nl>0 comprises all possible combinations and permuta- 
tions of A, G , T and C in the group N nl . 

37. A method according to any of the preceding claims, where- 
in the ligated cDNA fragments are sub-divided into a number 
of pools prior to the molecular amplification in step e) , 
each pool being subjected to the amplification using a subset 
of the set of amplification primers. 

33. A method according to claim 36, wherein the subset of 
amplification primers used for each pool comprises a primer 
as defined in claim 35. 

39. A method according to any of the preceding claims, which 
comprises the use of one amplification primer as defined in 
claim 35, and of one set of primers as defined in claim 36. 

40. A method according to claim 39, wherein the ligated cDNA 
fragments of step d) are subdivided into 4 nl pools which are 
each subjected separately to step e) wherein is used one 
amplification primer as defined in claim 35 and one primer 
from the set of amplification primers as defined in claim 36, 
said one primer being distinct from any one of the primers 
used for amplifying any of the other pools. 

41. A method according to any of the preceding claims, which 
comprises the further step of separating amplified fragments 
obtained from the molecular amplification procedure. 

42. A method according to claim 41, wherein the separation is 
performed by gel electrophoresis or chromatography. 

43. A method according to claim 41 or 42, which further 
comprises the step of identifying separated amplified frag- 
ments . 



44. A method according to claim 43, wherein the identifica- 
tion is performed by visualization. 
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45. A method according to claim 44, wherein labelled nucleo- 
tides are visualized, the labelled nucleotides being part of 
a probe or of the amplified fragments. 

46. A method according to claim 45, wherein the labelled 
nucleotides are the labelled nucleotides being part of the 
labelled primers as defined in claim 35. 

47. A method according to claim 46, wherein the visualization 
is performed by incorporating radioactive or fluorescent 
alpha dNTP into the cDNA fragment during PCR, where N = A, C, 
T , U or G. 

45. A method for determining the presence of an expression 
product in a cell or group of cells, the method comprising 
providing an RNA- containing sample from the ceil or group of 
cells and subjecting the sample to the method according to 
any of claims 1-47, and thereafter performing a comparison of 
the thus identified amplified cDNA fragments with a database 
output, said database output comprising a computer-generated 
list of molecular weights of restriction DNA fragments of 
known sequences, said list being prepared by 

inputting and storing DNA sequence data in a database as 
virtual DNA sequences, 

subsequently simulating cleavage of the virtual DNA 
sequences with the at least one restriction nuclease and 
storing the resulting simulated cleavage products as 
virtually cleaved DNA fragments, 

simulating ligation to the virtually cleaved fragments of 
the at least two adapter fragments and storing the re- 
sults as virtually ligated DNA fragments, 
for each individual combination of primers used in step 
e) , grouping the virtually ligated DNA fragments suscep- 
tible to amplification by said combination of primers in 
the same group, 
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determining, in each group, the absolute and/or relative 
molecular weight of each virtually ligated DNA fragment, 
and 

outputting the content of each group in the form of a 
list comprising the absolute and/or relative molecular 
weights of the virtually ligated fragments in the group. 

49. A .method according to claim 48, wherein the input DNA 
sequence data are linked to data relating to the genetic 
origin of the DNA sequence data and optionally to data relat- 
ing to functional features relating to the genetic origin. 

50. A method according to claim 48, wherein the output indi- 
cation further comprises information about the genetic origin 
of the virtually ligated DNA fragment and optionally informa- 
tion about functional features associated with the genetic 
origin . 

51. A method according to any of claims 38-50, wherein the 
comparison is performed by inputting the identified amplified 
cDNA fragments in a format which allows automated comparison 
with the database output, or, alternatively, by outputting 
the database output in a format which allows for direct 
comparison between the separated amplified cDNA fragments and 
the database output. 

52. A method for determining change in expression, compared 
to the expression in a reference cell or reference group of 
cells, of an expression product in a cell or group of cells 
which has been subjected to a first set of conditions in- 
fluencing the expression pattern of said cell or group of 
cells, said reference cell or group of cells being subjected 
to a second set of conditions, the method comprising pro- 
viding an RNA- containing sample from the cell or group of 
cells and subjecting the sample to the method according to 
any of claims 43-47 thereby obtaining data describing the 
amplified cDNA fragments derived from the sample, providing 
reference data describing amplified cDNA fragments derived 
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from an RNA- containing reference sample from the reference 
cell or reference group of cells, the reference data being 
obtained by having previously subjected the reference sample 
to the method according to any of claims 43-47, 
subsequently performing a comparison of the data and the 
reference data to identify those cDNA fragments which are 
expressed at different levels in the two data sets, and 
thereafter using the differentially expressed cDNA fragments 
to determine which expression products are subject to a 
change in expression level. 

53. A method according to claim 52, wherein the data and 
reference data are selected from the group consisting of the 
apparent molecular weights of the amplified DNA fragments, 
the M r of the amplified DNA fragments, the absolute amount of 
the amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments. 

54. A method according to claim 52, wherein the reference 
data are extracted from a database containing the reference 
data defined in claim 53 and optionally further information 
relating to the genetic origin of each amplified cDNA frag- 
ment from the reference. 

55. A method for diagnosing a disease in a subject, said 
disease being characterized by a deviating expression level 
of at least one expression product in at least one cell type, 
the method comprising providing an RNA- containing sample 
derived from the at least 0^3 cell type, subjecting the 
sample to L he method according to any of claims 43-47 thereby 
obtaining data describing the amplified cDNA fragments deri- 
ved from the sample, providing reference data describing 
amplified cDNA fragments derived from a RNA- containing refe- 
rence sample derived from the - same type of cell from a sub- 
ject not suffering from the disease, the reference data being 
obtained by having previously subjected the reference sample 
to the method according to any of claims 43-47, and 
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subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
known to be related to the disease, and assessing whether a 
significant difference in the data and reference data exists 
5 so as to establish whether the expression level of the ex- 
pression product deviates or not. 

56. A method according to claim 55, wherein the data and 
reference data are selected from the group consisting of the 
apparent molecular weights of the amplified DNA fragments, 

10 the M, of the amplified DNA fragments, the absolute amount of 
the amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments. 

57. A mechod according to claim 55, wherein the reference 
data are extracted from a database containing the reference 

15 data defined in claim 56 and optionally further information 
relating to the genetic origin of each amplified cDNA frag- 
ment from the reference. 

56. A method of synthesizing first strand cDNA, the method 
comorisina subjecting a sample comprising mRNA to reverse 

20 transcription wherein, in a first step performed at a tempe- 
rature not: exceeding 55°C, a first enzyme is used having a 
Substantial reverse transcriptase activity at said tempera- 
ture not exceeding 55°C, and, in a subsequent second step 
performed at an elevated temperature in the range of 45°C - 

25 95°C a second enzyme is used having a substantial reverse 
transcriptase activity at said elevated temperature, said 
first enzyme being substantially inactive. 

59. A method according to claim 58, wherein both enzymes are 
present in both steps. 

30 60. A method according t. -i*ir- 55. wherein said first enzyme 
has a substantially high-- ,:-.:vi-y than said second enzyme 

, T _-^ a ^„ v ^v-~~ - r . rticn at said temperature net 

rrxceedma 5 5°C . 
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61. A method according to claim 59 or 60, wherein said second 
enzyme has a substantially higher activity than said first 
enzyme in catalyzing reverse transcription at said elevated 
temperature . 

62. A method according to any of claims 53-61, wherein said 
first enzyme is selected from the group consisting of non- 
thermostable reverse transcriptases and wherein said second 
enzyme is selected from the group consisting of thermostable 
DNA polymerases with reverse transcriptase activities. 

63. A composition for use in reverse transcription of RNA, 
the composition comprising 

a) a first enzyme having reverse transcriptase activity 
at temperatures not exceeding 55°C 

b) a second enzyme having reverse transcriptase activity 
at elevated temperatures in the range of 45°C - 95°C, 

said second enzyme having a substantially higher activity 
than said first enzyme in catalyzing reverse transcription at 
said elevated temperatures. 

64. A composition according to claim 63, wherein said first 
enzyme has a substantially higher activity than said second 
enzyme in catalyzing reverse transcription at said tempera- 
tures not exceeding 55°C . 

55. Use of a thermostable enzyme having reverse transcriptase 
activity in the preparation of a composition for use in 
reverse transcription of RNA which has previously been in 
vitro reverse transcribed by another enzyme having reverse 
transcriptase activity. 

66. A method of oreparmg .ll \zhip) coated witn cDNA 

fraamentS/ the method zomp: .. 

RECTIFIED SHEET (RULE 91) 
ISA/EP 
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subjecting an RNA-containing sample to the method of any 
of claims 41-47, and 

transferring the separated amplified cDNA fragments to a 
chip surface adapted to stably bind the separated ampli- 
5 fied cDNA fragments while maintaining the spatial relati- 

ve distribution pattern thereof. 

67. A method of preparing a surface (chip) coated with cDNA 
fragments, the method comprising 

subjecting an RNA-containing sample to the method of any 

10 of claims 1-40, 

separating, by electrophoresis, the thus obtained ampli- 
fied cDNA fragments on a particular surface adapted :o 
stably bind the separated amplified cDNA fragments while 
maintaining the relative distribution pattern after 

15 electrophoresis. 

68. A method according to claim 67, wherein the electrophore- 
sis is in the form of microelectrophoresis. 

69. A method according to any of claims 66-68, wherein the 
transfer is accomplished by a electrophore tic blotting tech- 

2 0 nique. 

70. A method according to any of claims 66-68, wherein tne 
transfer is accomplished by photo-activated organic or inor- 
ganic chemistry techniques. 

71. A surface having cDNA stably bound thereto, said surface 
25 being obtainable by the method according to any of claims 66- 

70. 

72. A method for the screening for genes within a family or 
genes, the method comprising crovilir.g a surface according to 
claim 71, wherein cDNA stably hound to the surface is hybri- 

^ n .^-^ ,-~Hor- ■ rv»/ =;i-r - na^nrv -■•"i::::ns to a cetectahly label- 
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led nucleic acid which is a representative of a gene family, 
and subsequently analyzing fragment's of the chip to which 
hybridization has occurred so as to determine whether such 
fragments are related to the same gene family. 

5 73. A method for determining the difference in expression 
pattern between a first cell or type of cells and a second 
cell or type of cells, the method comprising providing sam- 
ples of labelled RNA or cDNA from the first and second cells 
or cell types and subsequently contacting each of these 
10 samples with a surface according to claim 71, and subsequent- 
ly detecting the amount and distribution of bound labelled 
RNA cr cDNA from each sample. 

74. A method for screening for interactions between a pre- 
selected protein and a polypeptide fragment, the method 

15 comprising preparing a sub-divided library of amplified cDNA 
fragments according to the method of any of claims 1-47, 
optionally adapting the terminals of the members of the 
library so as to facilitate insertion into a vector, insert- 
ing the fragments into vectors, transforming a population of 

20 suitable host cells with the vectors, culturing the host 

cells under conditions which enable expression of correctly 
inserted c DMA fragments by the host cell, and subsequently 
assaying polypeptide fragments encoded by the inserted cDNA 
fragments for interaction with the pre-selected protein. 

25 75. A method according to claim 74, wherein assaying of the 

polypeptide fragments is performed by a two-hybrid technique, 
wherein the host ceils are eukaryotic cells which are mated 
or transfected with nucleic acid material encoding the pre- 
selected protein, successful mating/ transfection of the 

30 cell(s) resulting in a ceil or cells wherein the interaction 
between the pre-selected protein and a polypeptide fragment 
gives rise to a detectable .rignai. 

7 £ . - method according tc rl.^irv ~ '.. , wherein a fungus, such as 
s -/east ceil, is used in tr- mating ' transfection . 
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77. A method according to claim 75 or 76, wherein the detect- 
able signal is provided by Green Fluorescent Protein. 
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Fig. 9 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-57,66-77 

A method for preparing a normalized sub-divided library of 
amplified cDNA fragments from the coding region of mRNA 
contained in in a sample, the method comprising the steps of 
a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer, b) 
synthesizing second strand cDNA complementary to the first 
strand cDNA fragments by use of the first strand DNA 
fragmnets as templates, and a second primer and c) 
subjecting the cDNA fragments obtained in step b) to a 
molecular amplification procedure so as to obtain amplified 
cDNA fragments, wherein is used a set of amplification 
primers; a method for preparing a normalized sub-divided 
library of amplified cDNA fragments from the coding region 
of mRNA contained in in a sample, the method comprising the 
steps of a) subjecting the mRNA derived from the sample to 
reverse transcription using at least one cDNA primer, b) 
synthesizing second strand cDNA complementary to the first 
strand cDNA fragments by use of the first strand DNA 
fragmnets as templates, thereby obtaining double stranded 
cDNA fragments, c) digesting the double stranded cDNA 
fragments with at least one restriction endonuclease, 
thereby obtaining cleaved cDNA fragments, d) ligating at 
least two adapter fragments to the cleaved cDNA fragments 
obtained in step c) , so as to obtain ligated cDNA fragments, 
and e) subjecting the ligated cDNA fragments in step d) to a 
molecular amplification procedure so as to obtain amplified 
cDNA fragments, wherein is used, for an adapter fragment 
used in step d), a set of amplification primers; a method 
for determining the presence of an expression product in a 
cell or a group of cells using said method; a method for 
determining change in expression, compared to the expression 
in a reference cell or reference group of cells; a method of 
diagnosing a disease in a subject; a method of preparing a 
surface (chip) coated with cDNA fragments; a method for 
screening for genes within a family of genes; a method for 
screening for interactions between a pre-selected protein 
and a polypeptide fragment; said method whrein assaying of 
the polypeptide fragments is performed by a two-hybrid 
technique; 



2. Claims: 58-65 

A method of synthesizing first strand cDNA, the method 
comprising subjecting a sample comprising mRNA to reverse 
transcription wherein, in a first step performed at a 
temperature not exceeding 55°C, a first enzymeis used 
having a substantial reverse transcriptase activity at a 
temperature not exceeding 55°C, and, in a subsequent second 
step performed at an elevated temperature in the range of 
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45°C-95°C, a second enzyme is used having a substantial 
reverse transcriptase activity at said elevated temperature, 
said first enzyme being substantialy inactive; a composition 
for use in reverse transcription of RNA, the composition 
comprising a) a first enzyme having reverse transcriptase 
activity at temperatures not exceeding 55°C b) a second 
enzyme having reverse transcriptase activity at elevated 
temperatures in the range of 45°C - 95°C, said first enzyme 
has substantially higher activity than said second enzyme in 
catalyzing reverse transcription at said temperature not 
exceeding 55°C; use of thermostable enzyme having reverse 
transcriptase activity in the preparation of a composition 
for use in reverse transcription of RNA which has previously 
been in vitro reverse transcribed by another enzyme having 
reverse transcriptase activity; 
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A METHOD TO CLONE mRNAs AND DISPLAY OF DIFFERENTIALLY EXPRESSED TRANSCRIPTS (DODET) 



BACKGROUND OF THE INVENTION 



The human body is comprised primarily of specialised cells 
performing different physiological functions organised into 
organs and tissues. All human cells contain DNA, arranged in 
a series of sub-units known as genes. It is estimated that 
there are approximately 100,000 genes in the human genome. 
Gentd are the blueprints for proteins. Proteins may perform a 
wide variety of biological functions, for example messengers, 
catalysts and sensors. Such compounds are responsible for 
managing most of the physiological and biochemical functions 
in humans and all other living organisms. Over the last few 
decades, there has been a growing recognition that many major 
diseases have a genecic basis. It is now well established 
that genes play an important role in cancer, cardiovascular 
diseases, psychiatric disorders, obesity, and metabolic dis- 
eases. Significant resources are being focused on genomic 
research based on the notion that the nucleotide sequences of 
a particular gene and its predicted protein product will lead 
to an understanding of its function in healthy and malfunc- 
tioning cells or tissues. This understanding is expected, in 
turn, to lead to therapeutic and diagnostic approaches, 
focused on molecular targets associated with the gene and the 
protein it expresses. The first step on the way to the deve- 
lopment of such applications is to identify the genes speci- 
fically involved in the different categories of diseases. 
Application of this knowledge can produce new and valuable 
markers, identifying regions producing major diseases to be 
used for diagnostic and therapeutic benefit. 



Faced with the high complexity of the human genome; many 
approaches are being used to "unravel the connection between 
primary gene structure and function. One well publicised 
approach is embodied in the Human Genome Mapping Project, 
where the sequence of all the individual genes in the entire 
human genome is painstakingly being determined. At the pre- 
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sent, however, little information can be directly retrieved 
on the function of the identified genes and still less about 
temporal and spatial expression patterns of the developing or 
mature organism. Other approaches, such as random cDNA se- 
quencing, involve the sequence determination of all genes 
expressed in a certain tissue, or developmental stage, of an 
organism. Like a number of other strategies, this is time 
consuming and prone to numerous problems. 

Although the flood of data from large scale sequencing pro- 
grammes is of enormous benefit to the scientific community, 
one of the major problems faced by such "shotgun" approaches 
is the lack of specific information that can be retrieved 
without significantly more work on the biology of each of the 
individual genes. 

Several other approaches have been taken by molecular biolo- 
gists to obtain more specific information on the genetic 
background of particular biological processes. Such approa- 
ches rely on a common concept. One gene, or a subset of 
genes, is switched on, initiating the healthy, pathological, 
or developmental status of an organ or cell type. 

In a large number of experimental systems the isolation of 
genes, on the basis of their differential expression, has 
been applied successfully. Differential screening and sub- 
tractive hybridisation of cDNA libraries have become well 
established, cf. Zimmerman et al . (1930) and Davis et al . 
(1979) . Differential library screening works well in practice 
for genes that are highly expressed, but mRNAs of -low abun- 
dance are difficult to isolate. Subtractive hybridisation 
provides a more sensitive screening, but requires large 
amounts of RNA. More recently RNA fingerprinting methods 
(often referred to as differential display or DD/RT PCR) have 
been added to these tools, offering attractive new features 
for isolating genes. RNA fingerprinting methods are PCR based 
and therefore do not require large amounts of RNA for expe- 
riments. In addition to this, RNA fingerprinting methods 
allow a large number of RNA pools :o be screened for specific 
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mRNAs simultaneously. Investigation of a wide range of patho- 
genic developmental stages and their controls would be pos- 
sible. To date, two methods of RNA fingerprinting have proven 
useful for isolating genes. In 1992 Liang et al. published a 
protocol (US Patent 5,262,311), soon after a protocol from 
Welsh et al. (1992) was presented. Both methods begin with 
cDNA synthesis from RNA using at least one arbitrary primer 
for the initiation of first and second strand synthesis. 

Welsh et al. (1992) designed a protocol in which the same 
arbitrary 20-mer oligo is used for first and second strand 
synthesis. Using arbitrary primers only a subset of the mRNAs 
are transcribed to cDN* T he cDNA pools are then used for a 
standard PCR with the same primers. One of the dNTPs in the 
PCR mix contains a radioactive label ( 35 S or 32 P) for visua- 
lisation of the PCR fragments with PAGE. The Liang and Welsh 
methods rely on at least one small arbitrary primer for 
selection of specific cDNAs . As a consequence annealing 
temperatures are low (~40°C>, and all amplified cDNA frag- 
ments originate from a certain degree of mismatch priming. 
Later several groups produced refinements and optimisations 
leading to a plethora of articles describing the usefulness 
of the methrd (Bauer and Warthoe et al . 1993; Warthoe et al . 
1995; Liang and Warthoe et al . 1995; Rohde and Warthoe et al . 
1996) . 

OBJECT OF THE INVENTION 

It is an object of the present invention to provide new 
methods and means for investigating the expression patterns 
in cells, especially in eukaryotic cells. The results of such 
investigations may be used in drug development, gene discove- 
ry, diagnosis of diseases etc., and therefore such improved 
methods are highly desirable. 
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SUMMARY OF THE INVENTION 

In its broadest scope, the invention pertains to a method for 
preparing a sub-divided library of amplified cDNA fragments 
from the coding region of mRNA contained in a sample, the 
method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer having the 
general formula 

5 * - Con-, - dT n2 - V n3 - N n4 - 3 ' 

wherein Con n is any sequence between 1-100 nucleotides, 
dT is deoxythymidinyl , V is A, G or C, N is A, G, C or T, 
n2 is an integer > 1, n3 is 0 or 1, if n3 is 0 then n4 is 
0, and if n3 is 1 n4 is an integer > 0, thereby obtaining 
first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, and a second cDNA primer with 
the general formula 

5 ' -Con 2 -N x „3 ' 

wherein Con 2 is any sequence between 1-100 nucleotides 
and can be different or identical to con 1 , N x is A, G, T 
or C, and x is an integer a 0, in a appropriate enzyme/ - 
buffer solution which comprises the DNA pol I enzyme or 
the Klenow fragment of the DNA pol I enzyme, all four 
deoxyribonucleoside triphosphates and standard buffer and 
temperature conditions, thereby obtaining double stranded 
cDNA fragments, 

c) subjecting the cDNA fragments obtained in step b) to a 
molecular amplification pr.vedure so as to obtain ampli- 
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fied cDNA fragments, wherein is used a set of amplifica- 
tion primers having the general formula 

5 ' -con 3 -N nl -3' I 

wherein Con 3 is a sequence identical to either Con! or 
Con 2 or both, N is A , G, T or C, and nl is an integer > 
0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amp 1 if- nation of any nucleotide sequence 
complementary in its 5' -end to Cor^ or Con 2 . 

This method is advantageous for amplifying very small amounts 
of RNA. Using the method of the invention it is possible to 
perform gene-profile analysis from less than 100 cells equal 
to 10" 9 gram total RNA (10 pgram RNA per cell) . 

In a further aspect, the invention relates to a method for 
preparing a sub-divided library of amplified cDNA fragments 
from the coding region of mRNA (which may be of prokaryotic, 
Archae or eukaryotic origin) contained in a sample, the 
method comprising the steps of 

a) subjecting the mRNA derived from the sample to rever- 
se transcription using at least one cDNA primer, thereby 
obtaining first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, thereby obtaining double 
stranded cDNA fragments, 



c) digesting the double stranded cDNA fragments with at 
least one restriction endonuclease , thereby obtaining 
cleaved cDNA fragments, 
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d) ligating at least two adapter fragments to the clea- 
ved cDNA fragments obtained in step c) , so as to obtain 
ligated cDNA fragments, and 

e) subjecting the ligated cDNA fragments obtained in 
step d) to a molecular amplification procedure so as to 
obtain amplified cDNA fragments, wherein is used, for an 
adapter fragment used in step d) , a set of amplification 
primers having the general formula 

5' -Com-N nl -3' II 

wherein Com is a sequence complementary to at lease the 
5' -end of an adapter fragment which is ligated to the 3'- 
end of a cleaved cDNA fragment, N is A, G, T, or C, and 
nl is an integer > C, and wherein at least one set of 
primers has the general formula II where nl > 0, said at 
least one set being capable of priming amplification of 
any nucleotide sequence ligated in ics 3' -end to the 
adapter fragment complementary in its 5' -end to Com. 

The overall advantage of the invention compared to the prior 
art is that the resulting library of cDNA fragments contains 
nucleic acid sequences from all parts of cDNA which is pro- 
duced in seep a). Prior art techniques which i.a. rely on 
poly-dT cDNA priming have a tendency to only yield fragments 
derived from the long untranslated regions of mRNA. Further- 
more, by fine-tuning of the conditions in each step, the 
method of ^he present invention results in highly specific 
reproduction of sequence information which is present in 
mRNA , even in mRNA which is only present in relatively low 
amounts. Furthermore, by choosing the optimum composition of 
endonuclease (s) it is possible to obtain cDNA fragments which 
are derived from a very large percentage of the total number 
of transcribed genes in relevant cells. 

The present method allows the targeted visualisation of known 
genes by using primer combinations, corresponding to sequen- 
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ces from Che gene of interest. This has the advantage that 
all steps of the procedure and the biological system can 
easily be verified. Also, very specific expression analyses 
can be carried out on related genes with very high homology 
which could not be achieved by using hybridisation technolo- 
gy- 

Briefly, further steps in the method of the invention involve 
isolation of bands of interest from a gel, their cloning and 
sequencing. The sequence information allows re -amplification 
of individual bands, using primers with the appropriate 3-4 
nucleotide extensions. When run on a gel, these reactions 
will show one, or only a few, bands per lane, giving an 
unequivocal determination of band identity. 

Since the present technology makes use of end labelled pri- 
mers for visualization, the technology can be used, both with 
standard technologies involving radioactivity, or with fluo- 
rescent labelled primers, without the need for further opti- 
misation. 

The invention also pertains to methods for detecting diffe- 
rences between expression level (s) in cells which have been 
subjected to different conditions, methods for diagnosing 
disease, and methods related to "bioinf ormatics" wherein are 
used a combination of output from the above -disclosed method 
and data obtained by computer- simulation of corresponding 
treatment of well-defined stretches of nucleic acids. 

A separate part of the invention pertains to a novel method 
for performing reverse transcription, methods which yield 
considerably enhanced quality in the reversely transcribed 
material. Also means for carrying out this separate part of 
the invention are disclosed. 



In the following is given a short discussion of terms used 
the present application: 
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"A sub -divided library of amplified cDNA fragments" is in the 
present context a library of amplified cDNA fragments which 
is split into a number of separate pools, each pool being 
defined by the sequences of the termini of the amplified 
fragments. For example, one pool may contain amplified frag- 
ments which are all characterized by having the sequence 5'- 
Com-AGC- in one of the strands, whereas another pool contains 
amplified fragments having the sequence 5'-Com-AAT in one of 
the strands. For a discussion of the meaning of "Con" and 
"Com", cf. below. 

"A normalised library" is a library containing substantially 
equal representation of each mRNA, i.e. approximately the 
same number of copies of each mRNA. 

"Reverse transcription" has its usual meaning in the art, 
i.e. synthesis of DNA using RNA as a template and effected by 
an enzyme having reverse transcriptase activity. 

"Adapter fragment" is intended to mean a nucleic acid sequen- 
ce containing a known sequence which can be used as template 
for a primer in a subsequent molecular amplification proce- 
dure such as PCR. The adapter fragment is further characte- 
rized by its ability to become integrated at the end of a 
cDNA fragment which has previously been cleaved with a re- 
striction endonuclease in step c) . In most cases, the re- 
striction endonuclease leaves fragments having "sticky ends", 
to which the adapter fragment will anneal readily, and there- 
after the adapter fragment becomes ligated to the cDNA by the 
action of a DNA ligase. 



DETAILED DISCLOSURE OF THE INVENTION 



In the following, the impact .of each of the steps will be 
discussed in detail, see Figures 1 and 7. 
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Step a) 

The goal in step a) is to produce a mixture of first strand 
cDNA fragments which is optimized in its composition for 
carrying out the subsequent steps. A number of considerations 
5 apply: 

First of all, to reduce the "background noise", it is prefer- 
red that the annealing of cDNA primer to the RNA in step a) 
is performed under high stringency conditions, thereby en- 
suring that a minimum of mismatches are introduced in the 
10 cDNA relative to the mRNA, i.e. at a temperature above 50°C. 

Secondly, it is desirable to obtain copies of sequences which 
are derived from ail parts of mRNA in order to obtain infor- 
mation relating to the translated part of the mRNA. Prior art 
methods for reverse transcription of eukaryotic material have 

15 often utilised poly-dT as cDNA primers. This strategy has, 
however, the disadvantage that the most efficiently reverse 
transcribed material is situated in the untranslated part of 
the genes of interest. Hence, the only parts of the mRNA 
which become "visible" after e.g. a PCR procedure will very 

20 often be derived from untranslated regions of the RNA. The 
reason for this is two effects. First of all, the poly-dT 
approach has the consequence that the initiation point of 
reverse transcription is situated very far from e.g. the 
start codon relating to the operon in question. Secondly, the 

25 mRNA may include structures (e.g. "hairpin" structures due to 
intra-cha:n base-pairing) which block reverse transcription 
and by always initiating reverse transcription at one termi- 
nus of a gene, such structures will statistically block 
reverse transcription of a number of translated regions. 

30 It is in the present invention preferred to ensure that cDNAs 
are produced in step a) which are representatives of the 
entire gene, including the translated regions. 
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This can be obtained in a number of ways. If poly-dT priming 
{or a variation thereof) is used, it is preferred to perform 
the reverse transcription at an elevated temperature, e.g. in 
the range from about 45°C to about 95°C / and to use an enzyme 
5 having reverse transcriptase activity at said temperature. 
Normally the temperature will be higher than 45°C, e.g. at 
least 50°C, or even higher, e.g. at least 55°C, at least 
60°C, at least 65°C or even higher, e.g. at least 70°C. This 
approach has the effect that the elevated temperature ensures 
10 that e.g. hairpin structures are "stretched out" during the 
reverse transcription step, thereby avoiding the lack of 
reversely transcribed fragments upstream of such structures. 

Known enzymes having reverse transcriptase activity at such 
elevated temperatures are enzymes selected from the group 

15 consisting of DNA polymerases derived from thermophilic 

eubacteria, such as the polymerases Taq (Thermus aquaticus) , 
Stof fel (Thermus aquaticus) , Tht (Thermus thermophilus) , 
Tfl/Tub (Thermus flavus) , Tru (Thermus Ruber), Tea (Thermus 
caldophilus) , Tfil (Thermus f iliformis) , Tbr (Thermus Brocki- 

20 anus), Bst (B. Stearothermophilus) , Bca (B. Caldotenax YT-G) , 
Bcav (B. Caldovelox YT-F) , FjSS3-B.l (Thermotoga FjSS3-B.l), 
Tma (Thermus Maritima) , UITma (T. Maritima) , Tli (T. Litora- 
lis) , Tli exo- (T. Litoralis}, 9°N-7 (Thermococcus sp.), BG-D 
(Pyrococcus sp.), Pfu (P. furiosus) , Pwo (P. woesei) , Sac (3. 

25 Acidocaldarius) , Ssol (S. Solf ataricus ) , Tac (T. Acidophi- 
lum) , and Mth (Methananococcus Voltae) . 

One minor disadvantage of using these thermostable enzymes is 
that they have a tendency to be relatively ineffective com- 
pared to the "traditional" non- thermostable, reverse tran- 

30 scriptases. Hence, especially if priming of the reverse- 
transcription is not limited to the use of poly-dT primers, 
it is according to the invention possible to use non- thermo- 
stable, reverse transcriptases. Hence, in other preferred 
embodiments, the reverse transcription is carried out at a 

3 5 temperature ii- the range from about 25°C to about 55°C by use 
of an enzyme having reverse transcriptase activity at said 
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temperature. Normally the temperature will not exceed 50 °C, 
and usually it will be lower, such as at most 47°C, at most 
45°C, at most 43°C / at most 40°C, and at most 35°C. The 
reverse transcriptase can e.g. be selected from the group 
consisting of the reverse transcriptases from AMV (Avian 
Myeloblastosis Virus), M-MuLV (murine M-MuLV pol gene), and 
HIV-l (HIV virus) . 

According to the invention, the most preferred way of carry- 
ing out step a) is to carry ou . reverse transcription in two 
subsequent steps, the first step comprising carrying out 
reverse transcription at the temperature conditions described 
above for non- thermostable enzymes, and the second step 
comprising carrying out reverse transcription at the tempera- 
ture conditions described above for thermostable enzymes. 
Normally this can be accomplished by having two non- identical 
enzymes present in the reverse transcription reaction, espe- 
cially because the non- thermostable enzyme will be inactiva- 
ted by the increase in temperature which is introduced when 
going into step 2. Of course, the enzymes can be added for 
each reaction step, but it is preferred that both enzymes are 
present from the start of the reaction. 

It is especially preferred that the activity of the enzyme 
which is active in the first step is substantially abolished 
in the second step (e.g. as a consequence of temperature 
denaturing of that enzyme), or expressed otherwise, that in 
the second step the enzyme used in the first step is substan- 
tially inactive. In general, it is preferred that the enzymes 
used in each step are substantially more active in the rele- 
vant temperature range than the one wherein the other enzyme 
is used. 

In a preferred embodiment the reaction mixture with the 
sample comprises a cDNA primer, said cDNA primer being suffi- 
ciently complementary to the target RNA present in the sample 
to hybridize therewith and initiate synthesis of a single 
stranded cDNA molecule complementary to said target RNA and 
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the reaction mixture comprises an appropriate buffer which 
comprises all four deoxyribonucleoside triphosphates and a 
divalent cation selected from the group of Mg +2 and Mn 2+ in a 
concentration between 0.1 and 5 mM. 

5 In fact, it is believed that the above strategy for conduct- 
ing reverse transcription by use of two enzymes having diffe- 
rent temperature optima and of which one has a temperature 
optimum at which impeding structures in the RNA are "stretch- 
ed out", is novel and inventive in its own right; 

10 Preferred combinations of enzymes in this embodiment of the 
invention are that the enzyme effecting reverse transcription 
in the first step is MMuLV, AMV, HIV-l and/or the enzyme 
effecting reverse transcription in the second step is Tth or 
Taq . 



| 15 An object of the method of the invention is to obtain a 
: subdivision of the cDNA produced. When the mRNA is derived 

from a eukaryotic system, the at least one cDNA primer may 
i include an oligo or poly dT tail in the 5' -end, having the 

general formula 5 ' -dT ri2 -V r . 3 -N n4 -3 ' , wherein dT is deoxythymi- 
20 dinyl, V is A, G, or C, N is A, G, C, or T, n2 is an integer 
> 1, n3 is 0 or 1, if n3 is 0 then n4 is 0, and if n3 is 1 
then n4 is an integer s 0. It will be clear that when n3 and 
n4 are both zero, then the primer is an ordinary poly- or 
oligo -dT cDNA primer. However, when n3 is 1, then the primer 
25 is in fact a primer composition which will be able to prime 
3 the reverse transcription of any mRNA having a poly-A tail. 

If the original sample of RNA is subdivided, and each sub- 
pool is subjected to reverse transcription which uses one of 
I the possible primers having the above formula where n3 is "1, 

30 then the result is a number of single stranded cDNA pools 
which are each different from each other in the 5' -end. 

For example, when n3 is 1, 3 x 4 ; ' 4 groups of cDNA primers are 
• f . used, each group being distinct from any one of the ocher 

groups with respect to the structure - V,.. .. -N n4 - . In such an 
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embodiment the pool of mRNA is conveniently subdivided into 
3 x 4 n4 aliquots which are each subjected separately to step 
a) utilising one of the 3 x 4 n4 groups of cDNA primers, • 
thereby obtaining a subdivision of the first strand cDNA into 
3 x 4 n4 separate pools. Normally n4 will be 0 or 1, resulting 
in the provision of 3 or 12 pools, respectively. 

When the scarting material is not eukaryotic or when it is 
not the intention to necessarily set out from the part of the 
transcribed gene which is most remote relative to the trans- 
lation start codon, the at least one cDNA primer does not 
include a poly or oligo dT tail in the 5' -end, or, alternati- 
vely, at least two cDNA primers are used of which at least 
one includes a poly or oligo dT tail in the 5' -end and of 
which at least one second does not include a poly or oligo dT 
tail in the 5' -end. Preferably, the cDNA primer which does 
not include a poly or oligo dT tail in the 5 ' end has the 
following structure 

5'-N x TTA-3' or 5'-N x CTA-3' or 5 ' -N X TCA- 3 ' , 

wherein N is A, G, T, or C, and x is an integer 1 < x <; 20. 
It will be clear that this corresponds to cDNA priming set- 
ting out from any translation stop codon. As for the above 
embodiments utilising a poly- or oligo-dT tailed primer, it 
is, by preparing primers having all possible permutations 
represented in the group N x , possible to compose the primers 
so as to correspond to any possible sequence preceding a stop 
codon, thereby ensuring priming of all sequences having a 
stop codon in their sequence. 

Step b) 

This step is carried out by methods well known in the art. It 
is, however, preferred that step b) is carried out under con- 
ditions which minimize the forr-~icn mismatches between 
nucleotides in the first and s-.. : ona cDNA strands. The double 
stranded cDNA procedure can be L-=r formed according to stan- 
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dard methods as described in Sambrook et al . (1989). However 
since standard polymerases can have difficulty in synthesi- 
sing regions containing secondary structures or with high GC- 
content, thermostable RNase H (Hybridase Thermostable RNase 
H, US 5,268,289) and thermostable rBst DNA polymerase from 
Bacillus stearothermophilus help overcome some of the limita- 
tions that standard polymerases (low temperature polymerases) 
suffer from. 



Step c) 

In one embodiment of the invention the ligated cDNA fragments 
obtained in step b) are subjected to a molecular amplifica- 
tion procedure so as zz ^uain amplified cDNA fragments, 
wherein is used a set of amplification primers having the 
general formula 

5' -Con,-N nl -3' 1 

wherein Con 3 is a sequence identical to either Con! or 
Con 2 or both, N is A, G, T or C, and nl is an integer a 
0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con x or Con 2 . 

In another embodiment, after the preparation and optional 
subdivision of the mRNA, each of the different pools of cDNA 
is digested -ith at least one restriction enzyme to produce 
fragments of a size which can be separated using an appropri- 
ate size fractionation method. 



The choice of restriction enzyme is based largely on the 
frequency of the cleavage sites in a given cDNA pool. Too 
many cleavage sites in each cDNA fragment will result in too 
small fragments, and vice vers-:. Optimally, the at least one 
enzyme should cleave every cd;;,\ \.z yield fragments the 
desired size. Statistically, 1* L=> net possible to cleave 



WO 98/51789 g 



PCT/DK98/00186 



every cDNA, but on the other hand a very large percentage can 
be cleaved by choosing a suitable enzyme or combination of 
enzymes. It is preferred that the method of the invention 
utilises at least one restriction enzyme chosen so as to 
ensure that at least 60% of cDNAs are cleaved, but higher 
percentages such as at least 65%, at least 70%, at least 75%, 
at least 80%, or even at least 85% are more preferred. 

Preferably the invention should use restriction enzymes that 
leave protruding ends (sticky ends) at the termini of the DNA 
after digestion in step c) , since this greatly facilitates 
the introduction of the adapter fragments in step d) . 

As will appear from the above, the frequency with which the 
restriction endonuclease cleaves is important. The at least 
one restriction enzyme is preferably chosen so as to cleave 
each complete cDNA into an average of about 3 fragments. It 
will be understood that some cDNAs- obtained from preceding 
steps will not be cut at all (although this is a rare inci- 
dence when the restriction enzyme (s) is/are carefully chosen) 
whereas others will be cut with a high frequency. It has come 
out that use of a rare 4 base cutter as at the least one 
restriction endonuclease (such as the 4 base cutter Acil, 
Alul, Bfal, BstUI, Csp6I, Dpnl , DpnII, Haelll, Hhal, HinPlI, 
Hpall, Mbol, Mnll, Msel, Mspl, Nlalll, Rsal, Sau3AI, Tail, 
TaqI, and Tsp509H ensures the optimum performance of the 
inventive method. By use of such a rare 4 base cutter, the 
use of only 1 restriction enzyme in step c) is sufficient and 
results in superior output . 

Alternatively, a combination of restriction endonucleases can 
be used wherein a balance of e.g. 6 base cutters and 4 base 
cutters ensures a reasonable distribution of fragment sizes. 
For instance the use of a first restriction enzyme (e.g. a 6 
base cutter) which statistically cleaves at least 20% of 
complete cDNA derived from the mRHA sample into two subfrag- 
ments, and of a second restriction enzyme (e.g. a 4 base 
cutter) which statistically cleaves at least 50% of said 
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subfragments into 3 further subf ragments , will also result in 
a series of fragments suitable for later size fractionation. 



Step d) 

The mixture (s) obtained in step c) are then subjected to a 
reaction wherein adapter fragments are added to both ends of 
the double stranded cDNA fragments obtained. As mentioned 
above, this part of the procedure is greatly facilitated by 
the cleaved cDNA fragments having protruding "sticky" ends, 
because pre-designed adapter fragments which fit to these 
protruding ends can easily be prepared. 

The adapter (or anchor) fragments are added to the cleaved 
fragments in order to obtain "order in chaos" in the sub- 
sequent step. By adding known sequences to the termini of the 
cleaved fragments, one creates targets for specific amplifi- 
cation primers which can be designed specifically with the 
aim of amplifying sequences complying to the adapter frag- 
ments. The material thus obtained (primary template) can be 
pre-amplif ied, using primers complementary to the ligated 
adaptor sequences, giving rise to secondary template. The 
pre-amplif ication of primary template allows virtually un- 
limited amounts of template to be produced from one RNA 
preparation, avoiding the need for repeated isolatiors. 

The adaptor sequence is thus selected so as to serve as the 
starting point for DNA polymerisation in e.g. a PCR reaction 
The adaptor sequences are constructed in such a way ^at the 
specific endonuclease sites are not regenerated after liga- 
tion of said adaptor. 

In a preferred embodiment at least one termination fragment 
is also ligated to the 3 '-end of single strands of cleaved 
cDNA fragments, said at least one termination fragment intro 
ducing a block against DNA polymerization in the 5 ' -*3 ' direc 
tion setting out from the at lease one termination fragment 
and said at least one terminal i c:: fragment being unable to 
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anneal to any primer of the at least two primer sets in step 
e) during the mol- nular amplification procedure. 

The above is a very important procedure when combined with 
the use of detection effected by labelled primers in the 
amplification step, wherein only one member of the pair of 
primers is labelled whereas the other is designed to split up 
the amplified products according to their base composition 
adjacent to the adapter fragment. One important feature is 
that * single stranded cDNA fragment which has been provided 
with a termination fragment will not be amplified, because no 
primers will be able to anneal to the products of a first 
round polymerisation wherein such a fragment was the templa- 
te, see Figure 7. Secondly, the approach opens for the possi- 
bility of removing background "noise" in a subsequent: detec- 
tion phase. 

Normally, the at least one termination fragment comprises or 
is a chemically modified nucleotide sequence, such as for 
instance a nucleotide sequence which comprises a dideoxy- 
nucleotide in the 3' -end; this termination technique is- well- 
known from e.g. the chain- termination sequencing technique 
according to Sanger. Under normal circumstances, the dideoxy- 
nucleotide should, according to the invention, be covalently 
attached to the nucleotide strand so as to avoid loss of the 
dideoxynucleotide during subsequent rounds of amplification. 
Superior stabilisation is attained if the dideoxynucleotide 
is phosphorylated. 

As mentioned above, the ligation of adapter and/or termina- 
tion fragments to the cleaved cDNA fragments in step d) is 
conveniently achieved by annealing the adapter fragments to 
sticky ends of the cDNA resulting from the cleavage in step 
c) and subjecting the product to the action of an enzyme 
having DNA ligase activity. Any suitable DNA ligase known in 
the art can be used. 
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Step e) 

Step e) of the method of the invention results in the final 
sorting of the modified cDNA fragments from step d) . As steps 
b) , c) and d) are combined in the broadest embodiment of the 
invention, step e) corresponds to step c) of this embodiment 
described above. 

The primers having the structure of formula I (step c) or II 
(step e) are designed so as to selectively amplify synthesi- 
zed double stranded cDNA fragments obtained in step b) or 
predefined subsets of the adapted fragments obtained in step 
d) . A number of ways this can be done may be envisaged, but 
the main strategy is to prime amplification in a series of 
separate reactions where the nucleotide sequence of one 
primer in one reaccion ensures that the amplified products of 
that reaction are different from those obtained from any of 
the other reactions and that all the reactions result in 
amplification of all fragments obtained from step b) or d) , 
respectively . 

Even though the at least one set of amplification primers of 
formula I or II wherein has a nl which is > 0, it is prefer- 
red that nl = l, nl = 2, nl = 3, or nl=4 in one of the primers, 
because the number of primer fragments to be used in the 
reactions in order to cover all possible nucleotide stretches 
adjacent to the Con or adapter fragment is easily manageable. 
For instance, if nl=5, it would be necessary to use 4 5 -1024 
different primers in order to obtain amplification of all 
possible nucleotide sequences adjacent to the relevant adap- 
ter fragment, and since the preferred embodiment of the 
invention requires that each such primer is used in a sepa- 
rate reaction, the work involved would be problematic. 

It is also preferred that in one of the primers n=0, and it 
is especially preferred that chis primer is labelled, in 
order to facilitate determination of the amplified fragments. 
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Hence, in the most preferred embodiments of the invention, 
the adapted cDNA fragments are amplified in a number of sepa- 
rate reactions wherein a labelled primer is used (which is 
normally identical in all reactions) and at least one non- 
labelled primer which is a member of the set of primers 
described above where n>l. It is preferred that this set of 
amplification primers of formula I or II wherein nl>0 compri- 
ses all possible combinations and permutations of A, G, T, 
and C in the group N nl , since this will ensure that all 
possible cDNA fragments can be amplified by the set. 

Hence, the ligated cDNA fragments are sub-divided into a 
number of pools prior to the molecular amplification in step 
e) , and each pool is subjected to the amplification using a 
subset of the set of amplification primers, and in the most 
preferred embodiment the ligated cDNA fragments of step d) 
are subdivided into 4 nl pools which are each subjected sepa- 
rately to step e) wherein is used one amplification labelled 
primer as described above (nl=0) and one primer from the set 
of amplification primers as defined above (n>0) , said one 
primer being distinct from any one of the primers used for 
amplifying any of the other pools. By using this approach, 
the originally reverse transcribed and cleaved cDNA fragments 
are subdivided into 4 nl pools which can each be subjected to 
further steps. 



Further steps and applications 



The material obtained from the above -described series of 
reactions can now be utilised in a number of ways. Normally, 
a further step of separating amplified fragments obtained 
from the molecular amplification procedure is performed. This 
yields a mixture of amplified fragments which are separated 
e.g. by size separation, by mobility in a gel electrophoresis 
or by any suitable chromatographic method. Furthermore, a 
step of identification (e.g. by visualization of these sepa- 
rated fragments) is normally carried out for "book- keeping 
purposes"; the separated mixture of fragments will normally 
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be compared to some kind of reference which may be material 
derived from the same or another cell type. 

Visualization of the separated fragments can, as mentioned 
above, be achieved by one of the primers in the amplification 
reaction being labelled, but other methods are of course 
available. For instance, a specifically labelled probe which 
e.g. binds to one of the adapter sequences will visualise the 
fragments, but also labelled nucleotides which have been 
incorporated in the fragments during the amplification pro- 
cedure (e.g. a PGR) will of course be a suitable means for 
detection (e.g. by incorporating radioactive or fluorescent 
alpha dNTP into the cDNA fragment during PCR, where N = A, C, 
T, U or G) . 

However, it is preferred that visualisation of specific RNA 
Derived Fragments (RDFs) is achieved using primers which are 
radioactively or f luorescently labelled and are homologous to 
the adaptors. The comparatively high annealing temperatures 
(touch-down from 65°C to 56°C) which are preferably used 
ensure that polymerisation events will predominantly origina- 
te from perfect priming of adapter sequences and adjacent 
selective bases. Band intensities are largely a function of 
initial template concentration, whereas band intensities of 
the original Differential Display methods are dependent on 
the quality of the match between the individual template and 
primer. The visualisation of rare mRNAs using the present 
inventive metnods will be less hampered by the over-represen- 
tation of signal from highly abundant mRNAs . As in the case 
of arbitrary priming, the mismatch amplification and abundant 
RDFs always out-compete the amplification of rare fragments 
base pair perfectly. Our experiments suggest that as few as 
100 molecules can be routinely detected in a given template. 
This corresponds to less than 1 transcript per cell in the 
original tissue. 

One interesting part of the invention relates to the use of 
the above-described methods in bioinf ormat ics . In short, 
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known DNA sequences are inputted into a computer database, 
and on the basis of such sequences a comparison with a real- 
life run of the above-described methods can be performed. In 
this way, bands in a gel obt~ 4 ned from the methods of the 
invention can be unambiguously identified with respect to 
sequence, origin and even functionality. Hence, this part of 
the invention pertains to a method for determining the pre- 
sence of an expression product in a cell or group of cells, 
the method comprising providing an RNA- containing sample from 
the cell or group of cells an^ subjecting the sample to the 
method described above, and thereafter performing a compari- 
son of the thus identified amplified cDNA fragments with a 
database output, said database output comprising a computer- 
generated list of molecular weights of restriction DNA frag- 
ments of known sequences, said list being prepared by 

inputting and storing DNA sequence data in a database as 
virtual DNA sequences (these can be obtained and updated 
regularly from any database containing information about 
gene sequences from the relevant organism or cell type) , 
subsequently simulating cleavage of the virtual DNA 
sequences with the at least one restriction nuclease and 
storing the resulting simulated cleavage products as 
virtual cleaved DNA fragments (such simulation is relati- 
vely uncomplicated, since the recognition and cleavage 
patterns of a large number of restriction enzymes are 
already known) , 

simulating ligation to the virtually cleaved fragments of 
the at least two adapter fragments and storing tL- re- 
sults as virtually iigated DNA fragments (again, this 
merely requires that input is provided of the structure 
of adapter fragments used in the real-life process), 
for each individual combination of primers used in step 
e) grouping the virtually ligated DNA fragments suscep- 
tible to amplification by said combination of primers in 
the same group, 
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determining, in each group, the absolute and/or relative 
molecular weight of each virtually ligated DNA fragment, 
and 

outputting the content of each group in the form of a 
5 list comprising the absolute and/or relative molecular 

weights of the virtually ligated fragments in the group. 

It is preferred that a link is maintained between each member 
of the output list and the original sequence from which such 
a member has been derived. This can e.g. be done by linking 

10 the input DNA sequence data to uata relating to the genetic 

origin of the DNA sequence data and optionally to data rela- . 
ting to functional features relating to the genetic origin 
and thereafter maintaining the information as a pointer back 
in the system to said sequence. Hence, the output indication 

15 will conveniently further comprise information about the 
genetic origin of the virtually ligated DNA fragment and 
optionally information about functional features associated 
with the genetic origin. 

For ease of use of such a bio- inf ormat ic system, it is nor- 
20 mally necessary that 1) either the comparison is performed by 
inputting the identified amplified cDNA fragments in a format 
which allows automated comparison with the database output, 
or 2) the database output is outputted in a format which 
allows for direct comparison between the separated amplified 
25 cDNA fragments and the database output. For instance, if the 
visualized and separated cDNA fragments from step e) have 
been run on a gel, it will be possible to-either read a 
digital reproduction of the gel pattern into the computer and 
let the computer compare this input with the computer gene- 
30 rated pattern, or alternatively, to output the computer 
generated pattern in such a manner that it resembles an 
electrophoresis gel pattern. 



Si 
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Another part of the invention pertains to the use of the 
inventive method for comparing expression levels in different 
cells. One way of doing this is to determine the change in 
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expression, compared to' the expression in a reference cell or 
reference group of cells, of an expression product in a cell 
or group of cells which has been subjected to a first set of 
conditions influencing the expression pattern of said cell or 
group of cells, said reference cell or group of cells being 
subjected to a second set of conditions, the method compri- 
sing providing an RNA- containing sample from the cell or 
group of cells and subjecting the sample to the method of the 
invention for sub-division, thereby obtaining data describing 
the amplified cDNA fragments derived from the sample, pro- 
viding reference data describing amplified cDNA fragments 
derived from an RNA- containing reference sample from the 
reference cell or reference group of cells, the reference 
data being obtained by having previously subjected the refe- 
rence sample to the method of the invention, subsequently 
performing a comparison of the data and the reference data to 
identify the cDNA fragments which are expressed at different 
levels in -the two data sets, and thereafter using the diffe- 
rentially expressed cDNA fragments to determine which expres- 
sion products are subject to a change in expression level. In 
other words, the method of the invention is carried out twice 
on the basis of two different RNA samples derived from cells 
subjected to differing conditions. 

Normally, the data and reference data are selected from the 
group consisting of the apparent molecular weights of the 
amplified DNA fragments, the M r of the amplified DNA frag- 
ments, the absolute amount of the amplified DNA fragments, 
and the relative amount: of the amplified DNA fragments. The 
reference data can further be extracted from a database 
containing the reference data defined above and optionally 
further information relating to the genetic origin of each 
amplified cDNA fragment from the reference. 

Related to the above, the invention also allows for diagnosis 
of disease which is character:--i by a deviating (increased 
or reduced) expression level : At .east one expression pro- 
duct in at least one cell tyc-_, the method comprising pro- 
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viding an RNA- containing sample derived from the at least one 
cell type, subjecting the sample to the method of the inven- 
tion thereby obtaining data describing the amplified cDNA 
fragments derived from the sample, providing reference data 
5 describing amplified cDNA fragments derived from a RNA- con- 
taining reference sample derived from the same type of cell 
from a subject not suffering from the disease, the reference 
data being obtained by having previously subjected the refe- 
rence sample to the method according to the invention, and 

10 subsequently performing a comparison of the data and the 

reference data with respect to those cDNA fragments which are 
known to be relaced to the disease, and assessing whether a 
significant difference in the data and reference data exists 
so as co establish whether the expression level of the ex- 

15 pression produce deviates or not. 

As for the embodiment above, also here the data and reference 
data are selected from the group consisting of the apparent 
molecular weights of the amplified DNA fragments, the M r of 
the amplified DNA fragments, the absolute amount of the 

20 amplified DNA fragments , . and the relative amounts of the 

amplified DNA fragments, and also here the reference data can 
be extracted from a database containing the reference data 
defined abov- and optionally further information relating to 
the genetic origin of each amplified cDNA fragment from the 

25 reference. 



Further, the invention provides a method for treatment of a 
disease which is characterized by a deviating (increased or 
reduced) expression level of at least one expression product 
in at least one cell type, the method comprising providing an 

3 0 RNA- containing sample derived from the at least one cell 

type, subjecting the sample to the method of the invention 
thereby obtaining data describing the amplified cDNA frag- 
ments derived from the sample, providing reference data 
describing amplified cDNA fragments derived from a RNA-con- 

35 taining reference sample deriv-d frj-r.i the same type of cell 
from a subject not suffering i:::" the disease, the reference 
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data being obtained by having previously subjected the refe- 
rence sample to the method according to the invention, and 
subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
known to be or suspected of being related to the disease, and 
assessing whether a significant difference in the data and 
reference data exists so as to establish whether the expres- 
sion level of the expression product deviates or not. 

If the expression product is reduced, the disease may be 
treated' by delivering the expression product; if the expres- 
sion product is increased, the disease may be treated by 
delivering an inhibitor (e.g. an antibody) against the ex- 
pression product. The scope of the present . invention includes 
an expression product identified by the method of the inven- 
tion as such as well as methods for treating a disease which 
method has been provided by means of the method of the inven- 
tion. 

The mixtures of amplified fragments obtained from step e) of 
the method of the invention may also be used for preparing a 
surface (chip) coated with cDNA fragments. This can be done 
by 



subjecting an RNA- containing sample to the subdivision 
nethod of the invention including separation steps, and 



transferring the separated amplified cDNA fragments to a 
chip surface adapted to =tably bind the separated ampli- 
fied cDNA fragments while maintaining the spatial reiati 
ve distribution pattern thereof. 

Alternatively, such a chip can be prepared by 

- subjecting an RNA- containing sample to the method of the 
invention without performing the separation, and there- 
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separating, by electrophoresis, the thus obtained ampli- 
fied cDNA fragments on a particular surface adapted to' 
stably bind the separated amplified cDNA fragments while 
maintaining the relative distribution pattern after 
electrophoresis. In this embodiment, the electrophoresis 
is preferably in the form of microelectrophoresis. 

Transfer to the sur.face is preferably accomplished by a 
electrophoretic blotting technique, and/or by well-known 
photo- activated organic or inorganic chemistry coupling 
techniques . 

The invention also pertains to a surface obtainable by the 
above-mentioned method for the preparation thereof. Such 
surfaces are considered novel and inventive, since known "DNA 
chips" rely on specific introduction of an array of nucleic 
acid fragments of known structure, whereas the present method 
provides for "a semi-array" containing cDNA fragments charac- 
terizing a specific "situation" for a specific cell type, 
according to Figure 3 . 

Such a surface can i.a. be used for screening for genes 
within a gene family. The "array chip" is provided and there- 
after a labelled probe (which is a representative of a gene 
family) is allowed to hybridize to the chip under low strin- 
gency i.e. under conditions as described at pages 94-106 in 
"Nucleic acid hybridisation. A practical approach" edited by 
BD Hames & S J Higgins, IRL Press. A number of fragments 
coupled to the chip will hybridize to the probe, and these 
fragments can subsequently be identified, isolated and se- 
quenced/characterized in order to determine whether they are 
representatives of the same gene family. 

Another use of such " semi - arrays " is for determining the dif- 
ference in expression pattern between a first cell or type of 
cells and a second cell or type of ceils, the method compri- 
sing providing samples of labelled RNA cr cDNA from the first 
and second cells or cell types and subsequently contacting 
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each of these samples with a chip surface as described above, 
and subsequently detecting the amount and distribution of 
bound labelled RNA or cDNA from each sample. 

Under all circumstances, the chip surface with the cDNA bound 
thereto can e.g. be produced by the methods described in EP-0 
654 061. 

Yet another part of the invention pertains to a method for 
screening for interactions between a pre- selected protein and 
a polypeptide fragment, the method comprising preparing a 
sub-divided library of amplified cDNA fragments resulting 
from step e) , optionally adapting the cerminals of the mem- 
bers of the library so as to facilitate insertion in a vec- 
tor, inserting the fragments into vectors, transforming a 
population of suitable host cells with the vectors, culturing 
the host ceils under conditions which enable expression of 
correctly inserted cDNA fragments by the host cell, and 
subsequently assaying polypeptide fragments encoded by the 
inserted cDNA fragments for interaction with the pre-selected 
protein. 

One convenient way of achieving this is by way of a two- 
hybrid technique, wherein the host cells are eukaryotic cells 
(such as fungal cells, especially yeast cells) which are 
mated or transfected with nucleic acid material encoding the 
pre-selected protein, successful mat ing/ transf ection of the 
cell(s) resulting in a cell or cells wherein the interaction 
between the ^re- selected protein and a polypeptide fragment 
gives rise to a detectable signal. 

Such methods have recently attracted a great deal of atten- 
tion, i.a. as a consequence of the disclosure in Fromont- 
Racine et. al . , Nature Genetics 16, 277-282 (1997), which is 
incorporated by reference herein. 



One convenient system for providing the detectable signal is 
by use of Green Fluorescent Protein, disclosed in EP-A-0 569 
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170, wherein changes in fluorescent spectrum due to inter- 
actions are used as reporter. 

Finally, the invention pertains to a composition for use in 
reverse transcription of RNA, the composition comprising 

a) a first enzyme having reverse transcriptase activity 
at temperatures not exceeding 55 °C 

b) a second enzyme having reverse transcriptase activity 
at elevated temperatures in the range of 45°C - 95°C (and 
es^cially the temperatures discussed above for perform- 
ing reverse transcription at elevated temperatures) , 

said second enzyme having a substantially higher activity 
than said first enzyme in catalyzing reverse transcription at 
said elevated temperatures. It is preferred that the first 
enzyme has a substantially higher activity than said second 
enzyme in catalyzing reverse transcription at said tempera- 
tures not exceeding 55°C, and it is also preferred that the 
second enzyme has a substantially higher activity than said 
first enzyme in catalyzing reverse transcription at said 
temperatures exceeding 45°C. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 
First, the drawing will be briefly described. 



Fig. 1 

Basis of Display Of Differentially Expressed Transcripts. 



Fig. 2 

Anchor and PCR primer design. 



Fig. 3 

An autoradiogram of a DODET gel using the cellular set-up 
described in Example 1; rat pheochromocytoma PC12 cells were 
stimulated with the Nerve Growth Factor (NGF) and Epidermal 
growth factor { EGF) . 
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Lanes 1-24; reverse transcription using the anchored poly T 
primer 5' -T 25 AA-3' 

Lanes 25-48, reverse transcription using the anchored poly . 
T primer 5' -T 25 GC-3 ' 

Lanes 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45 represent 
the PC12 cells not treated. 

Lanes 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42 and 46 repre- 
sent the PC12 cells treated with the NGF factor for 60 minu- 
tes . 

Lanes 3, 7, 11, 15, 19, 23, 27, 31, 35, 39, 43 and 47 repre- 
sent the PC12 cells treated with the NGF factor for 90 minu- 
tes 

Lanes 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44 and 48 repre- - 
sent the PC12 cells treated with the EGF factor for 90 minu- 
tes . 

Lanes 1-48, using the following pairs for the pre-PCR 
amplifications : 

Tagl pre-amplif ication primer: 5 ' - CAGCATGAGTCCTGACCGA 
Bell pre-amp. if ication primer: 5 ' - CTCGTAGACTGCGTACCGATCA 

For the second PCR amplification the following primer pairs 
were used: 
Lanes 1-4 

5' -CATGAGTCCTGACCGAA 

5 ' - GACTGCGTACCGATCAA (5' end labelling) 

Lanes 5-8 

5 ' - CATGAGTCCTGACCGAA 

5' -GACTGCGTACCGATCAC (5' end labelling) 



Lanes 9 - 12 

5 ' - CATGAGTCCTGACCGAA 

5' - GACTGCGTACCGATCAG 



(5' end labelling) 
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Lanes 13 



- 16 



5' - CATGAGTCCTGACCGAA 



5' -GACTGCGTACCGATCAT 



(5' end labelling) 



Lanes 17 



- 20 



5' -CATGAGTCCTGACCGAC 



5' - GACTGCGTACCGATCAA 



(5' end labelling) 



Lanes 21 



- 24 



5' -CATGAGTCCTGACCGAC 



5' - GACTGCGTACCGATCAC 



(5 ' end labelling) 



Lanes 2 5 - 48 

Repeated primer combinaci^ from lanes I - 24 
Fig. 4a 

Northern Blot of RDF01 sequence from cellular total RNA. 
a) PC12 cells not treated, b) NGF treatment for 60 minutes, 
c) NGF treatment for 90 minutes, and d) EGF treatment for 90 
minutes . 



Loading control, RNA extracts were elect rophoresed on a i.2% 
agarose gel containing ethidium bromide, used as a control to 
determine the relative concentration of RNA in each lane, a, 
b, c, d same as in Figure 4a 

Fig. 4c 

Northern Blot of RDF02 sequence from cellular total RNA. 
a) PC12 cells not treated, b) NGF treatment for 60 minutes, 
c) NGF treatment for 90 minutes, and d) EGF treatment for 90 
minutes . 



Loading control, RNA extracts were elect rophoresed on a 1.2% 
agarose gel containing ethidium bromide, used as a control to 
determine the relative amount of RNA in each lane, a, b, c, d 
same as in Figure 4c 



Fig. 4b 



Fig. 4d 
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Fig. 5 

Searching for genes modulated by a growth factor. 

Lane i Size marker in bp (150 bp, 200 bp, 250 bp). 

Amplification primer 5 " -Com-N nl - 3 ' where N nl 
is GAA 

Amplification primer 5 ' - Com-N nl - 3 ' where N nl 
is GAC 

Amplification primer 5 ' -Com-N nl -3 * where N nl 
is GAG 

Amplification primer 5 ' -Com-N nl - 3 1 where N nl 
is GAT 

Amplification primer 5 ' -Com-N nl - 3 ' where N nl 
is GCA 

In lane 11 a downreguiat ion is observed after 6 days treat- 
ment, whereas in lane 16 an upregulation is observed after 6 
days treatment. Both modulations are due to the growth fac- 
tor, since regulation is seen only when the active growth 
factor is present. 



Lanes 2-6 
Lanes 7-11 
Lanes 12-16 
Lanes 17-21 
Lanes 22-26 



Fig 6 

Searching for genes involved in bacterial resistance. 
Lane 1: Size marker in bp (150 bp, 200 bp, 250 bp, 

300 bp) . 

Amplification primer 5 ' -Com-N nl - 3 ' where N nl 
is GAA 

Amplification primer 5 ' - Com-N nl - 3 ' where N nl 
is GAC 

Amplification primer 5 ' - Com-N nl - 3 ' where N nl 
is GAG 

Amplification primer 5 ' - Com-N nl - 3 * where N nl 
is GAT 

Amplification primer 5 ' -Com-N nl - 3 ' where N nl 
is GCA 

Amplification primer 5 ' -Com-N nl -3 1 where N nl 
is GCC 



Lanes 2-5 
Lanes 6-9 
Lanes 10-13 
Lanes 14-17 
Lanes 18-21 
Lanes 22-25 



In lanes 8-9 a downreguiat ion 
upregulation is observed. Boer 



is observed, in lanes 20-21 an 
gene modulations are potential 
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genes involved in the resistance to the Bacteriamycin, Ino- 
sin . 



Fig. 7 

Principle of the technology used in Examples 4 and 5. 

After ds-cDNA synthesis the DNA is digested with one 4 base 
pair endonuclease and anchors are ligated to the ds-cDNA 
ends. Using special design primers the expression profiles 
are obtained by amplifying th- mRNAs in different expression 
windows (sub- fractions) . The number of expression windows 
depends on the complexity of the sample i.e. 64 expression 
windows in eukaryotic. 

Fig . 8 

Principles of a gene discovery DNA surface (a DNA chip) . 

After size separation of the DNA fragments, the DNA fragments 
are transferred to a nylon membrane using an elect rophoretic 
principle. The membrane is hybridized with a complex DNA 
probe generated using the principle of the invention. Alter- 
natively the membrane can be hybridized with one single gene 
to identify new members of a particular gene family. The 
membrane are in the x coordinates separated in 64 expression 
windows, and in the y coordinates separated in base pair size 
(from 50 base pair to 1200 base pair) according to principle 
described in figure 7. 

Fig. 9 

Principle of generation 64 pools of 3' END cDNAs 
Step 1 

Production of single stranded cDNA using 5'-con i -T n V oligo- 
nucleotide where con i is an oligonucleotide between 1-100 
nucleotide, n is between 5-40 and V is a mixture of A, C and G. 
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Step 2 

Double stranded cDNA synthesis are produced using 5'Con 2 N x 
where con 2 is an oligonucleotide between 1-100 nucleotide, x is 
between 1-10, and N is a mixture of A, C, T and G. The ds-cDNA 
synthesis is synthesized by Klenow enzyme with the above- 
described oligonucleotide. 

Step 3 

Pre-amplif ication of double stranded cDNA to amplify the double 
stranded cDNA, the cDNA is PCR amplified using a combination of 
con x and con 2 primers. 

Step 4 

The pre -amplified cDNA is further amplified and separated in 64 
pools using a combination of a labeled con x and 64 con 2 NNN 
primers in a PCR amplification procedure, where NNN are 
combined in 64 different ways using the nucleotides A, T, G and 
C. . 

Step 5 

Each of the 64 pools is separated using the Page electro- 
phoresis principle . 

EXAMPLES 

In order to verify the functionality of the invention, examp- 
les are described below in which a developmental eukaryotic 
cellular system, pheochromocytoma PC12 , was employed. 

Nerve Growth Factor (NGF) induces growth arrest and neurone 
outgrowth in the in vitro PC12 cell system. Other growth 
factors, such as epidermal growth factor (EGF) , support 
survival and stimulate growth. NGF- induced genes, include the 
immediate early genes, which encode transcription factors, 
such as c-fos and c-myc. The products of the immediate early 
genes are thought to be involve i in regulating the expression 
of genes, associated with the :-.-urc::a3. phenotype for example 
neurofilaments, peripherin, GA- 4 * and transin. 
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In order to identify new early genes involved in neuronal 
differentiation and proliferation, the following DODET method 
is used for identify such genes. 

In the following examples, it is demonstrated how efficiently 
the method of the invention can be applied to such cellular 
systems . 

EXAMPLE 1 

The rat pheochromocytoma PC12 cells were grown (in vitro) in 
the presence and absence of Nerve Growch Factor (NGF) and 
epidermal growth factor ( EGF ■ under growth conditions descri- 
bed elsewhere (Saltiei et al. 1996). 

The total RNA was isolated using the standard single-step 
method by Chomczynski and Sacchi according to Sarrtorook et al 
1989 . 

Total RNA concentration was determined spectrophotometrically 
and then adjusted to 0.2 iiq/iil. This RNA was used directly in 
the Northern analysis. 

For DODET 4 x 0.5 fig total RNA was reverse transcribed in 
separated pools using the primer 5'-T 25 AA-3'. The same pro- 
cedure was performed using the 5'-T 25 GC-3' poly-dT anchored 
primers, giving a total of 2 x 4 x 0.5 fxg of RNA. 



First strand synthesis 



20.0 ill total RNA amount between 0.3 to 1.0 fig RNA 

3.0 pi 5'-T 25 AA-3' Cone. 100 ng//il or 5' -T 2S GC-3' Cone. 

100 ng/fil 

5.0 pi 10 x cDNA buffer (buffer B from Epicentre Techno- 

logies # R19250) 
2.0 fil dNTPs (25 mM from r i.^rr.vacia 3iotech) 

1.0 jul Superscript II RT . L'/^i! (Gibco BRL # 18064- 

014) 
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5.0 ixl Retrotherm RT (1 U/^D (Epicentre 

#R19250) 
14 .0 /xl H 2 0 

To obtain high specificity, the cDNA reaction was incubated 
at 50°C for 30 minutes followed by 1 hour incubation at 70°C. 

Second strand synthesis: 

To the first scrand reaction, add the following components 
10 x cDNA buffer 

Hybridase Thermostable RNase (1 U//xl) (Epicentre 
Technoiogi.es !; .12 2 050 ) 

rBst thermostable DNA polymerase (1 U/VU ) (Epi- 
centre Technologies #BH1100) 
H 2 0 

Incubate at 65°C for 1 hour. 

The resulting double stranded cDNA was phenol extracted and 
precipitated and resuspended in 20 jul of H 2 0. Half of this 
volume was checked on gel; if a smear between 100 bp and 3000 
bp was observer, the rest of the cDNA was used for DCDET 
template production. The resulting cDNAs were digested with 
10 U of each of the thermostable restriction enzymes Taql and 
Bell at 50°C for 2 hours. To this mixture, DODET adapters 
were added and ligated to the ends of the restriction frag- 
ments with T 4 DNA ligase (1U) resulting in the primary tem- 
plate. 8-15 cycles of non- radioactive pre-amplif ication, 
using primers complementary to the DODET adapters, were 
performed on a small aliquot (l/10 th volume) of the primary 
template (94°C denaturation; 30 s, 56°C annealing; 30 s, 72°C 
polymerisation; 1 min) . The products of the amplification 

(termed secondary template) were also checked on a 1.5% 
agarose gel. As expected, fragment sizes were predominantly 
between 100 bp and 1000 bp. All amplification reactions were 

carried out on a PE-9600 thermocycier using Taq DNA-polyme- 
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15.0 /xl 

3.0 jLil 

1.0 pel 

81.0 fxl 
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rase, both from Perkin Elmer Corp. (Norwalk, CT, USA) . The 

final template was then diluted 10 fold with H 2 0. 

The adapters ligated to the restriction fragments, the pre- 
amplif ication and active PCR are given below: 

Tagl adapter: 5 ' - CAGCATGAGTCCTGAC 

TACTCAGGACTGGC - 5 ' 



Tagl pre-amplification primer: 5 ' - CAGCATGAGTCCTGACCGA 

Tagl amplification primer: 5 ' - CATGAGTCCTGACCGAN 

(N = A or C or G or T) 



Bell adapter: 5 ' - CTCGTAGACTGCGTACC 

CTGACGC ATGGCTAG - 5 ' 



Sell pre-amplification primer: 5 ' - CTCGTAGACTGCGTACCGATCA 

Sell amplification primer: 5' -GACTGCGTACCGATCAN 
(N = A or C or G or T) 

For PCR all the different combinations of one extension 
(denoted as N above) were available, giving a total of 4" 
primer combinations. All oligonucleotides were obtained from 
DNA Technology (Aarhus, Denmark) . 

Radioactive labelling of the Sell primer was performed using 
1U of T 4 polynucleotide kinase. Thermocycling was carried out 
essentially as described above but with 35 cycles and includ- 
ing an 11 cycle touch-down (the annealing temperature was 
reduced from 65°C to 56°C in 0.7°C steps for 11 cycles and 
subsequently maintained at 56°C for 23 cycles) . Samples were 
then boiled after the addition of dye and 50% formamide and 
separated on a 5% polyacrylamide sequencing type gel (GIBCO 
BRL Life Technologies Inc., Gaithersburg , MD, USA). All gels 
were run at standard conditions, such that the 70 bp marker 
was 3 cm from the bottom of the gel, giving good resolution 
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between 70-800 bp. Gels were then dried directly onto Whatman 
3M paper on a slab gel dryer. Labelled DNA fragments were 
visualised by autoradiography. Gels and films were positio- 
nally marked prior to development. The 1 base selective ex- 
tensions were chosen empirically to yield approximately 50 
radioactively labelled fragments per lane. 

Bands, identified on the autoradiogram as interesting, were 
lined up with markings on the film and the dehydrated gel and 
were excised. Excised fragments were monitored for activity. 
The gel fragments were isolated using GENECLEAN (BIO101, 
California USA) . DNA was then recovered according to the 
manufacturer's recommendations. ON A fragments could then be 
reamplified using the same PCR conditions and primers as used 
in the initial PCR; however, 15 cycles generally yielded 
sufficient product for cloning. Cloning was achieved using 
unpurified PCR product and the vector display-p!23T (Display 
Systems Biotech, USA) . Conditions were used as recommended by 
the manufacturer. 



EXAMPLE 2 



Figure 3 shows a typical DODET gel produced by amplification 
of template derived from treatment of PC12 cells with NGF or 
EGF . 

Total RNA was reverse transcribed with the S'-T^AA-S' and 
T^GC^' poly-dT anchored primers, and after anchor ligation 
pre-amplif ication with Bell and Taqrl pre-amplif ication nrimer 
pairs was performed. 

6 out of 16 possible primer combinations are shown, using 1 
selective base, at each restriction enzyme site (Figure 3) . 
The largest visible products (Figure 3) are approximately 
1000 bp in size and the lower end of the gel corresponds to 
approximate 100 bp. In this size window an average of 50 
bands can be scored for each primer combination. In Figure 3, 
various expression patterns can be detected. 
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Due primarily to the stringent conditions possible in DODET, 
resolution of the banding pattern is high while the level of 
background remains at acceptable levels (Figure 3). Further- 
more, quite radical changes in the intensity of individual 
bands over the treatment period do not seem to affect the 
patterns of other bands in the same lane. 

It is, therefore, possible to conclude that the PCR remains 
proportionally independent on the concentration of individual 
substrates in the reaction. 

The use of an optimised combination of standard protocols 
described above for isolating, re-amplifying and cloning 
individual RDFs , has allowed the identification of a number 
cf transcripts associated with differentiation and prolifera- 
tion events . 

Four RDFs were isolated for further analysis, Figure 3, bands 
a, b, c and d. 

Sequence analysis revealed that RDF a = RDF b and RDF c = RDF 
d, as illustrated in Figure 3. 

In all cases appropriate terminal sequences with the correct 
1 selective base extensions used in the PCR could be retriev- 
ed, demonstrating the stringency and fidelity of the system 
(data not shown) . 



EXAMPLE 3 



During scanning of the PC12 cellular systems treated with NGF 
or EGF with different primer combinations, two RDFs (designa- 
ted RDF01 and RDF02) exhibiting a differential expression 
during the NGF treatment were isolated (RDF01 - RDF a = RDF b 
and RDF02 = RDFc = RDF d. in Figure 3) . 

After re-amplification, sub-cloning and DNA sequencing, 
further DNA analysis revealed two unknown RDFs upregulated 
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after 60 minutes NGF treatment or 90 minutes EGF treatment in 
the PC12 cellular system. The nucleotide sequences of both 
RDF01 and RDF02 show less than 10% homology to any existing 
gene in the GeneBank or EMBL databases. 

The expression of RDF01 and RDF02 was further analyzed using 
Northern blot (Figure 4a - 4d) . Here, transcripts could 
clearly be detected at 60 minutes NGF treatment or 90 minutes 
EGF treatment of the PC12 cells, confirming the results 
obtained using the DODET method, as illustrated in Figure 3. 

Experiments to clone the full length of RDF01 and RDF02, and 
biological characterisation of their involvement in the 
differentiation and proliferation, of the PC12 cellular 
system are currently under investigation. 

EXAMPLE 4 

Searching for genes modulated by a growth factor. 

A human cell line was treated with a growth factor and RNA 
was isolated a various time points as indicated below. 

1 Cell without any treatment (lanes 2, 7, 12, 17, 22, 

2 Cell treated with helper agent, 1 day (lanes 3, 8, 13, 
18, 23) 

3 Cell treated with helper agent and growth factor, 1 day 
(lanes 4, 9, 14, 19, 24) 

4 Cell treated with helper agent, 6 days (lanes 5, 1 0, 
15, 20, 25) 

5 Cell treated with helper agent and growth factor, 6 
days (lanes 6, 11, 16, 21, 26) 

Human RNA was isolated and the gene discovery analysis was 
performed essentially as described in the legend to Fig. 3. 
5 out of 64 amplification primers are shown in Fig. 5, each 
covering a certain portion of the mRNA pool in the human cell 
line. The expression analysis was performed on an ALFexpress, 
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an automated fragment analyzer from Pharmacia Biotech, using 
a Cy5 label. 

In lane 11 a downregulat ion is observed after 6 days of 
treatment. In lane 16 an upregulation is observed after 6 
days of treatment. Both modulations are due to the growth 
factor, since regulation is only observed with the active 
growth factor present. 

EXAMPLE 5 

Searching for genes involved in bacterial resistance to 
antibiotics . 

A Listeria monocylogia strain was treated with the Bactena- 
mycin, Inosin. RNA from a strain resistant to Inosin was 
further investigated . 

1. Bacterial clone 1 without any treatment (lanes 2, 6, 
5 10, 14, 18) 

2. Bacterial clone 2 without any treatment {lanes 3, 7, 

11, 15, 19) 

3. Bacterial clone 3 resistant to Inosin (lanes 4, 8, 12, 

16, 20! 

0 4. Bacterial clone 4 resistant to Inosin (lanes 5, 9, 13, 

17, 21) 

Bacterial RNA was isolated by standard techniques and the 
gene discovery analysis was performed according to Example 4 
and Fig. 5, with the excepcion that a 5'-NNNNNNYYA primer was 
5 used for first strand synthesis. 



6 of 64 amplification primers are shown in Figure 6, each 
covering a certain portion of- the mRNA pool in the prokaryo- 
tic cell system. The expression analysis was performed on an 
ALFexpress, an automated fragment analyzer from Pharmacia 
0 Biotech, using a Cy5 label. 
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In lanes 8 and 9 a downregulat ion is observed and in lanes 20 
and 21 an upregulation is observed. Both gene modulations are 
potential genes involved in the resistance to Inosin. 
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CLAIMS 

1. A method for preparing a normalized sub-divided library of 
amplified cDNA fragments from the coding region of mRNA 
contained in a sample, the method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer having the 
general formula 

wherein Con l is any sequence between 1-iOC nucleotides, 
dT is deoxythymidinyl, V is A, G or C, N is A, G, C or T , 
n2 is an integer a l, n3 is 0 or 1, if r.3 is 0 then n4 is 
0, and if n3 is 1 n4 is an integer > 0, thereby obtaining 
first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, and a second cDNA primer with 
the general formula 



5 ' -Con 2 -N x .3 * 

wherein Con 2 is any sequence between i-100 nucleotides 
and can be different or identical to con x , N x is A, G, T 
or C, and x is an integer > 0, in a appropriate enzyme/ - 
buffer solution which comprises the DNA pol I enzyme or 
the Klenow fragment of the DNA pol I enzyme, all four 
deoxyribonucleoside triphosphates and standard buffer and 
temperature conditions, thereby obtaining double stranded 
cDNA fragments, and 

c) subjecting the cDNA fragments obtained in step b) to a 
molecular amplification procedure so as to obtain ampli- 
fied cDNA fragments, wherein is used a set of amplifica- 
tion primers having the general formula 
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5' -Con 3 -N nl -3' , I 



wherein Con 3 is a sequence identical to either Con x or 
Con 2 or both, N is A, G, T or C, and nl is an integer & 
0, wherein at least one set of primers has the general 
formula I where n > 0, said at least one set being ca- 
pable of priming amplification of any nucleotide sequence 
complementary in its 5' -end to Con x or Con 2 . 

2. A method for preparing a normalized sub-divided library of 
amplified cDNA fragments from the coding region of mRNA 
contained in a sample, the method comprising the steps of 

a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer, thereby 
obtaining first strand cDNA fragments, 

b) synthesizing second strand cDNA complementary to the 
first strand cDNA fragments by use of the first strand 
DNA fragments as templates, thereby obtaining double 
stranded cDNA fragments, 

c) digesting the double stranded cDNA fragments with at 
least one restriction endonuclease, thereby obtaining 
cleaved cD:iA fragments, 

d) ligating at least two adapter fragments to the cleaved 
cDNA fragments obtained in step c) , so as to obtain 
ligated cP M A fragments, and 

e) subjecting the ligated cDNA fragments obtained in step 
d) to a molecular amplification procedure so as to obtain 
amplified cDNA fragments, wherein is used, for an adapter 
fragment used in step d) , * a set of amplification primers 
having the general formula 
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wherein Com is a sequence complementary to at least the 
5' -end of an adapter fragment which is ligated to the 3'- 
end of a cleaved cDNA fragment, N is A, G, T, or C, and 
nl is an integer * 0, and wherein at least one set of 
primers has the general formula I where nl > 0, said at 
least one set being capable of priming amplification of 
any nucleotide sequence ligated in its 3' -end to the 
adapter fragment complementary in its 5' -end to Com. 

3. A method according to claim l or 2 , wherein the mRNA is of 
eukaryotic, Archae or prokaryotic origin. 

4. A rnsthod according to any of claims 1-3, wherein the 
reverse transcription is performed under high stringency 
conditions . 

5. A method according to any of the preceding claims, wherein 
the reverse transcription is carried out at a temperature in 
the range from about 45°C to about 95°C by use of an enzyme 
having reverse transcriptase activity at said temperature. 

6. A method according to claim 5, wherein the enzyme is 
thermostable, such as an enzyme selected from the group 
consisting of a DNA polymerase with reverse transcriptase - 
activity derived from thermophilic eubacteria, such as Taq 
(Thermus aquaticus) , Stof fel (Thermus aquaticus) , Tht (Ther- 
mus thermophilus) , Tfl/Tub (Thermus flavus) , Tru (Thermus 
Ruber), Tea (Thermus caldophilus) , Tfil (Thermus filiformis) , 
Tbr (Thermus Brockianus) , Bst (B. Stearothermophilus ) , Bca 
(B. Caldotenax YT-G) , Bcav (B. Caldovelox YT-F) , FjSS3-B.l 
(Thermotoga FjSS3-B.l), Tma (Thermus Maritima) , UITma (T. 
Maritima) , Tli (T. Litoralis) , Tli exo- (T. Litoralis), 9°N-7 
(Thermococcus sp.), BG-D (Pyrococcus sp.) , Pfu (P. furiosus) , 
Pwo (P. woesei) , Sac (S. Acidocaldarius) , Ssol (S. Solfatari- 
cus) , Tac (T. Acidophilum) , and Mth (Methananococcus Voltae) . 



7 . A method according to any 
reverse transcription is cam 



: :laims 1-4, wherein the 

~:i out at a temperature in the 
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range from about 25°C to about 55°C by use of an enzyme 
having reverse transcriptase activity at said temperature. 

8. A method according to claim 7, wherein the enzyme is a 
reverse transcriptase, such as a reverse transcriptase selec- 
ted from reverse transcriptase from AMV (Avian Myeloblastosis 
Virus), M-MuLV (murine M-MuLV pol gene), or HIV-1 (HIV 
virus) . 

9. A method according to any of claims 1-4, wherein the 
reverse transcription is carried out in two subsequent steps, 
the first step comprising carrying out reverse transcription 
as defined in claim 7 or 8, and the second step comprising 
carrvina out reverse transcription as defined in claim 5 or 



10. A method according to claim 9, wherein reverse transcrip- 
tion in the two steps is effected by non- identical enzymes 
having reverse transcriptase activity. 

11. A method according to claim 10, wherein the non- identical 
enzymes are added separately in each step or are present in 
both steps. 

12. A method according to claim 11, wherein the activity of 
the enzyme which is active in the first step is substantially 
abolished in the second step. 

13. A method according to any of claims 9-12, wherein the 
enzyme effecting reverse transcription in the first step is 
reverse transcriptase from MMuLV, AMV or HIV-1 and/or the 
enzyme effecting reverse transcription in the second step is 
Tth or Taq. 



14. A method according to any of the preceding claims, where- 
in the at least one cDNA primer includes an oligo or poly dT 
tail in the 3' end. 
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15. A method according to claim 14, wherein the at least one 
cDNA primer has the general formula 5 ' -dT n2 -V n3 -N n4 - 3 ' , where- 
in dT is deoxythymidinyl, V is A, G, or C, N is A, G, C or T, 
n2 is an integer > 1, n3 is 0 or 1, if n3 is 0 then n4 is 0, 
and if n3 is 1 then n4 is an integer > 0. 

16. A method according to claim 15, wherein, when n3 is 1, 
3 x 4 n4 groups of cDNA primers are used, each group being 
distinct from any one of the ether groups with respect to the 
structure -V n3 -N n4 - . 

17. A method according to claim 16, wherein the pool of mRNA 
is subdivided into 3 x 4 n4 aliquocs which are each subjected 
separately to step a) uti: ' "ing one of the 3x4 groups or 
cDNA primers, thereby obtaining a subdivision of the first 
strand cDNA into 3 x 4 n4 separate pools. 

18. A method according to claim 16 or 17, wherein n4 is 0 or 
1 . 

19. A method according to any of claims 1-13, wherein the at 
least one cDNA primer does not include a poly or oligo dT 
tail in the 5' -end, or wherein at least two cDNA primers are 
used of which a" least one includes a poly or oligo dT tail 
in the 5' -end and of which at least one second does not 
include a poly or oligo dT tail in the 5' -end. 

20. A method according to claim 19, wherein the cDNA primer 
which does not include a poly or oligo dT tail in the 5' -end 
has the following structure 

5'-N^TTA-3' or 5'-N v CTA-3' or 5 ' - N X TCA- 3 ' , 

wherein N is A, G, T, or C, and x is an integer 1 s x s 20. 



21. A method according to any or the preceding claims, where- 
in step b) is carried out under conditions which minimize the 
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formation of mismatches between nucleotides in the first and 
second cDNA strands. 

22. A method according to any of the preceding claims, where- 
in the at least one restriction enzyme is chosen so as to 
ensure that at least 60% of cDNA's are cleaved. 

23. A method according to any of the preceding claims compri- 
sing the use of at least one restriction endonuclease which 
upon cleavage of cDNA results in cleaved cDNA fragments 
having sticky ends. 

24. A method according to any of the preceding claims, where- 
in the at least one restriction enzyme is chosen so as to 
cleave each complete cDNA into an average of about 3 frag- 
ments . 

25. A method according to any of the preceding claims, com- 
prising the use of a rare 4 base cutter as at least one 
restriction endonuclease, such as the 4 base cutter Acil, 
Alul, Bfal, BstUI, Csp6I, Dpnl , DpnII, Haelll, Hhal, HinPlI, 
Hpall, Mbol, Mnll, Msel, Mspl, Nlalll, Rsal, Sau3AI, Tail, 
TaqI, and Tsp509I. 

26. A method according to any of the preceding claims, where- 
in one restriction enzyme is used. 

27. A method according to any of claims 1-21, which comprises 
the use of a first restriction enzyme which statistically 
cleaves at least 20% of complete cDNA derived from the mRNA 
sample into two subf ragments , and of a second restriction 
enzyme which statistically cleaves at least 50% of said 

subf ragments into 3 further subf ragments . 

28. A method according to any of the preceding claims, where- 
in, in step d) , at least one termination fragment is also 
ligated to the 3 -end of single strands of cleaved cDNA frag- 
ments, said at least one termination fragment introducing a 
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i block against DNA polymerization in the 5'-»3' direction 

setting out from the at least one termination fragment and 
said at least one termination fragment being unable to anneal 
4 to any primer of the at least two primer sets in step e) 

^ 5 during the molecular amplification procedure. 

4 29. A method according to claim 28 , wherein the at least one 

I termination fragment comprises or is a chemically modified 

nucleotide sequence. 

\' 30. A method according to claim 29, wherein the chemically 

■4 10 modified nucleotide sequence comprises a dideoxynucleot ide in 

the 3 ' -end. 

31. A method according to claim 30, wherein the dideoxy- 
nucleotide is covalently attached to the nucleotide strand. 

1 

$ 32. A method according to any of the preceding claims, where - 

i 15 in the ligation of adapter and/or termination fragments to 

1 the cleaved cDNA fragments in step d) is achieved by anneal - 

I ing the adapter fragments to sticky ends of the cDNA result- 

ing from the cleavage in step c) and subjecting the product 
to the action of an enzyme having DNA ligase activity. 

20 33. A method according to any of the preceding claims, where- 
in the at least one set of amplification primers of formula I 
or II wherein nl is a 0 has nl=l, nl=2, nl = 3, or nl=4. 



% 34. A method according to any of the preceding claims, where- 

in nl=0 in the at least one set of amplification primers 
^ 25 having formula I or II. 

35. A method according to claim 34, wherein the set of ampli- 
fication primers having nl=0 in formula I or II is labelled. 

36. A method according to any of the preceding claims, where- 
£ in the set of amplification primers having formula I or II 
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wherein nl>0 comprises all possible combinations and permuta- 
tions of A, G, T and C in the group N nl . 

37. A method according to any of the preceding claims, where- 
in the ligated cDNA fragments are sub-divided into a number 
of pools prior to the molecular amplification in step e) , 
each pool being subjected to the amplification using a subset 
of the set of amplification primers. 

38. A mt hod according to claim 36, wherein the subset of 
amplification primers used for each pool comprises a primer 
as defined in claim 35. 

39. A method according to any of the preceding claims, which 
comprises the use of one amplification primer as defined in 
claim 35, and of one set of primers as defined in claim 36. 

40. A method according to claim 39, wherein the ligated cDNA 
fragments of step d) are subdivided into 4 nl pools which are 
each subjected separately to step e) wherein is used one 
amplification primer as defined in claim 35 and one primer 
from the set of amplification primers as defined in claim 36, 
said one primer being distinct from any one of the primers 
used for amplifying any of the other pools. 

41. A method according to any of the preceding claims, which 
comprises the further step of separating amplified fragments 
obtained from the molecular amplification procedure. 

42. A method according to claim 41, wherein the separation is 
performed by gel electrophoresis or chromatography. 

43. A method according to claim 41 or 42, which further 
comprises the step of identifying separated amplified frag- 
ments . 



44. A method according to claim 43, wherein the identifi 
tion is performed by visualization. 
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45. A method according to claim 44, wherein labelled nucleo- 
tides are visualized, the labelled nucleotides being part of 
a probe or of the amplified fragments. 

46. A method according to claim 45, wherein the labelled 
nucleotides are the labelled nucleotides being part of the 
labelled primers as defined in claim 35. 

47. A method according to claim 46, wherein the visualization 
is performed by incorporating radioactive or fluorescent 
alpha dNTP into the cDNA fragment during PGR, where N = A, C, 
T, U or G. 

46. A method for determining the presence of an expression 
product in a cell or group of cells, the method comprising 
providing an RNA- containing sample from the cell or group of 
cells and subjecting the sample to the method according to 
any of claims, 1-47, and thereafter performing a comparison of 
the thus identified amplified cDNA fragments with a database 
output, said database output comprising a computer-generated 
list of molecular weights of restriction DNA fragments of 
known sequences, said list being prepared by 

inputting and storing DNA sequence data in a database as 
virtual DNA sequences, 

subsequently simulating cleavage of the virtual DNA 
sequences with the at least one restriction nuclease and 
storing the resulting simulated cleavage products as 
virtually cleaved DNA fragments, 

simulating ligation to the virtually cleaved fragments of 
the at least two adapter fragments and storing the re- 
sults as virtually ligated DNA fragments, 
for each individual combination of primers used in step 
e) , grouping the virtually ligated DNA fragments suscep- 
tible to amplification by said combination of primers in 
the same group, 
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determining, in each group, the absolute and/or relative 
molecular weight of each virtually iigated DNA fragment, 
and 

outputting the content of each group in the form of a 
list comprising the absolute and/or relative molecular 
weights of the virtually Iigated fragments in the group. 

49. A- method according to claim 48, wherein the input DNA 
sequence data are linked to data relating to the genetic 
origin of the DNA sequence data and optionally to data relat- 
ing to functional features relating to the genetic origin. 

50. A method according to claim 48, wherein the output indi- 
cation further comprises information about the genetic origin 
of the virtually Iigated DNA fragment and optionally informa- 
tion about functional features associated with the genetic 
origin . 

51. A method according to any of claims 38-50, wherein the 
comparison is performed by inputting the identified amplified 
cDNA fragments in a format which allows automated comparison 
with the database output, or, alternatively, by outputting 
the database output in a format which allows for direct 
comparison between the separated amplified cDNA fragments and 
the database output. 

52. A method for determining change in expression, compared 
to the expression in a reference cell or reference group of 
cells, of an expression product in a cell or group of cells 
which has been subjected to a first set of conditions in- 
fluencing the expression pattern of said cell or group of 
cells, said reference cell or group of cells being subjected 
to a second set of conditions, the method comprising pro- 
viding an RNA- containing sample from the cell or group of 
cells and subjecting the sample to the method according to 
any of claims 43-47 thereby obtaining data describing the 
amplified cDNA fragments derived from the sample, providing 
reference data describing amplified cDNA fragments derived 
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from an RNA- containing reference sample from the reference 
cell or reference group of cells, the reference data being 
obtained by having previously subjected the reference sample 
to the method according to any of claims 43-47, 
subsequently performing a comparison of the data and the 
reference data to identify those cDNA fragments which are 
expressed at different levels in the two data sets, and 
thereafter using the differentially expressed cDNA fragments 
to determine which expression products are subject to a 
change in expression level. 

53. A method according tc claim 52, wherein the data and 
reference data are selected from the group consisting of the 
apparent molecular weights of the amplified DNA fragments, 
the M r of the amplified DNA fragments, the absolute amount of 
che amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments. 

54. A method according to claim 52, wherein the reference 
data are extracted from a database containing the reference 
data defined in claim 53 and optionally further information 
relating to the genetic origin of each amplified cDNA frag- 
ment from the reference. 

55. A method for diagnosing a disease in a subject, said 
disease being characterized by a deviating expression level 
of at least one expression product in at least one cell type, 
the method comprising providing an RNA- containing sample 
derived from the at least one c^ll type, subjecting the 
sample to the method according to any of claims 43-47 thereby 
obtaining data describing the amplified cDNA fragments deri- 
ved from the sample, providing reference data describing 
amplified cDNA fragments derived from a RNA- containing refe- 
rence sample derived from the same type of cell from a sub- 
ject not suffering from the disease, the reference data being 
obtained by having previously subjected the reference sample 
to the method according to any of claims 43-47, and 
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subsequently performing a comparison of the data and the 
reference data with respect to those cDNA fragments which are 
known to be related to tn* -xsease, and assessing whether a 
significant difference in the data and reference data exists 
5 so as to establish whether the expression level of the ex- 
pression product deviates or not. 

56. A method according to claim 55, wherein the data and 
reference data are selecte^ from the group consisting of the 
apparent molecular weights of the amplified DNA fragments, 

10 -he M- of the amplified DNA fragments, the absolute amount of 
the amplified DNA fragments, and the relative amounts of the 
amplified DNA fragments. 

57. A method according to claim 55, wherein the reference 
data are extracted from a database containing the reference 

15 data defined in claim 56 and optionally further information- 
relating to the genetic origin of each amplified cDNA frag- 
ment from the reference. 

58. A method of synthesizing first strand cDNA, the method 
comprising subjecting a sample comprising mRNA to reverse 

20 transcription wherein, in a first step performed at a tempe- 
rature not exceeding 55°C, a first enzyme is used having a 
substantial reverse transcriptase activity at said tempera- 
ture not exceeding 55°C, and, m a subsequent second step 
oer formed at an elevated temperature in the range of 45°C - 

25 95°C, a second enzyme is used having a substantial reverse 
transcriptase activity at said elevated temperature, said 
first enzyme being substantially inactive. 

59. A method according to claim 53, wherein both enzym.es are 
cresent in both steps. 



30 5C . A method according t • : - ir. : : , wherein said first enzyme 
has a si 



ubstantiaiiv r.iqr.--- ;;:;vity than said second enzyme 



-atalvzinn reverse : :. ■ at saic temperature 



exceecmg z t C: . 
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61. A method according to claim 59 or 60, wherein said second 
enzyme has a substantially higher activity than said first 
enzyme in catalyzing reverse transcription at said elevated 
temperature . 

5 62 . A method according to any of claims 53-61, wherein said 
first enzyme is selected from the group consisting of non- 
thermcstable reverse transcriptases and wherein said second 
enzyme is selected from the group consisting of thermostable 
DNA polymerases with reverse transcriptase activities. 

10 63. A composition for use in reverse transcription of RNA, 
the composition comprising 

a) a first enzyme having reverse transcriptase activity 
at temperatures not exceeding 55°C 

b) a second enzyme having reverse transcriptase activity 
15 at elevated temperatures in the range of 45°C - 95°C, 

said second enzyme having a substantially higher activity 
than said first enzyme ir. catalyzing reverse transcription at 
saia elevated temperatures. 

64. A composition according to claim 63, wherein said first 
20 enzyme has a substantially higher activity than said second 

enzyme in catalyzing reverse transcription at said tempera- 
tures not e v ~eeding 55°C. 

65. Use of a thermostable enzyme having reverse transcriptase 
activity in the preparation of a composition for use in 

25 reverse transcription of RNA which has previously been in 
vitro reverse transcribed by another enzyme having reverse 
transcriptase activity . 

c6. A method of preparing ;thip; zoated with cDNA 

fragments, the method lornr : . 
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subjecting an RNA-containing sample to the method of any 
of claims 41-47, and 

transferring the separated amplified cDNA fragments to a 
chip surface adapted to stably bind the separated ampli- 
5 fied cDNA fragments while maintaining the spatial relati- 

ve distribution pattern thereof. 

67. A method of preparing a surface (chip) coated with cDNA 
fragments, the method comprising 

subjecting an RNA-containing sample to the method of any 

10 of claims 1-40, 

separating, by electrophoresis, the thus obtained ampli- 
fied cDNA fragments on a particular surface adapted to 
stably bind the separated amplified cDNA fragments while 
maintaining the relative distribution pattern after 

15 electrophoresis. 

68. A method according to claim 67, wherein the electrophore- 
sis is in the form of microelectrophoresis. 

69. A method according to any of claims 66-68, wherein the 
transfer is accomplisr "d by a electrophoretic blotting tech- 

2 0 nique. 

70. A method according to any of claims 66-63, wherein the 
transfer is accomplished by photo-activated organic or inor- 
ganic chemistry techniques. 

71. A surface having cDNA stably bound thereto, said surface 
25 being obtainable by the method according to any of claims 66- 

70. 

72. A method for the screening for genes within a family of 
genes, the method comprising cr-rvi-iir.g a surface according to 
claim 1 1 , wherein cDNA stably htunr: to the surface is hybri- 

3 0 cited under low stringency riiti^ns to a detect ably' label - 
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led nucleic acid which is a representative of a gene family, 
and subsequently analyzing fragments of the chip to which 
hybridization has occurred so as to determine whether such 
fragments are related to the same gene family. 

5 73. A method for determining the difference in expression 
pattern between a first cell or type of cells and a second 
ceil or type of cells, the method comprising providing sam- 
ples of labelled RNA or cDNA from the first and second cells 
or cell types and subsequently contacting each of these 

0 samples with a surface according to claim 71, and subsequent- 
ly detecting the amount and distribution of bound labelled 
RNA or cDNA from each sample. 

74. A method for screening for interactions between a pre- 
selected protein and a polypeptide fragment:, the method 

5 comprising preparing a sub-divided library of amplified cDNA 
fragments according to the method of any of claims 1-47, 
optionally adapting the terminals of the members of the 
library so as to facilitate insertion into a vector, insert- 
ing the fragments into vectors, transforming a population of 

0 suitable host cells with the vectors, culturing the host 

cells under conditions which enable expression of correctly 
inserted cDNA fragments by the host ceil, and subsequently 
assaying polypeptide fragments encoded by the inserted cDNA 
fragments for interaction with the pre-selected protein. 

5 75. A method according to claim 74, wherein assaying of the 
polypeptide fragments is performed by a two-hybrid technique, 
wherein the host cells are eukaryctic cells which are mated 
or transfected with nucleic acid material encoding the pre- 
selected protein, successful mating/ transf ection of the 

0 cell ( s ! resulting in a ceil or cells wherein the interaction 
between the pre-selected protein and a polypeptide fragment 
gives rise to a detectable -lanai. 

7*5. A method acctrdina tc :l~im " ' z . , wherein a fungus, such as 
a yeast cell, is used in t:\-t mat ir.y ' transf ecticn. 
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77. A method according to claim 75 or 76, wherein the detect- 
able signal is provided by Green Fluorescent Protein. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows. 

1. Claims: 1-57,66-77 

A method for preparing a normalized sub-divided library of 
amplified cDNA fragments from the cod! ng region of mRNA 
contained in in a sample, the method comprising the steps of 
a) subjecting the mRNA derived from the sample to reverse 
transcription using at least one cDNA primer, b) 
synthesizing second strand cDNA complementary to the first 
strand cDNA fragments by use of the first strand DNA 
fragmnets as te-plates, and a second primer and c) 
subiecting the cDNA fragments obtained in step b) to a 
molecular 9 amplification procedure so as to obtain amplified 
cDNA fragments, wherein is used a set of ampl i^"^™ 
Drimers- a method for preparing a normalized subdivided 
Kary'of amplified cDNA fragments from the coding region 
of mRNA contained in in a sample the method comprising the 
steps of a) subjecting the mRNA derived from the sample to 
reverse transcription using at least one cDNA priw, 
synthesizing second strand cDNA complementary to the first 
strand cDNA fragments by use of the first strand DNA 
fragmnets as templates, thereby obtaining double stranded 
cDNA fragments, c) digesting the double stranded cDNA 
fragments with at least one restriction endonuclease, 
thereby obtaining cleaved cDNA fragments, d] ligatmg at 
least two adapter fragments to the cleaved cDNA fragments 
obtained in step c) , so as to obtain ligated cDNA fragments, 
and e) subjecting the ligated cDNA fragments in step d) to a 
molecular amplification procedure so as to obtain amplified 
cDNA fragments, wherein is used, for an adapter fragment 
used in step d), a set of amplification primers; a method 
for determining the presence of an expression product in a 
cell or a group of cells using said method; a method tor 
determining change in expression, compared to the expression 
in a reference cell or reference group of cells; a method of 
diagnosing a disease in a subject; a method of preparing a 
surface (chip) coated with cDNA fragments; a method for 
screening for genes within a family of genes; a method for 
screening for interactions between a pre-selected protein 
and a polypeptide fragment; said method whrein assaying of 
the polypeptide fragments is performed by a two-hybrid 
technique; 

2. Claims: 58-65 

A method of synthesizing first strand cDNA, the method 
comprising subjecting a sample comprising mRNA to reverse 
transcription wherein, in a first step performed at a 
temperature not exceeding 55°C, a first enzyme is used 
havinq a substantial reverse transcriptase activity at a 
temperature not exceeding 55>C, and, in a subsequent second 
step performed at an elevated temperature in the range or 

■ ~ ~ " page 1 of 2 
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45°C-95°C a second enzyme is used having a substantial 
reverse transcriptase activity at said elevated temperature, 
said first enzyme being substantial inactive; a composition 
for use in reverse transcription of RNA, the composition 
comprising a) a first enzyme having reverse transcriptase 
activity at temperatures not exceeding 55 C b) a second 
enzyme having reverse transcriptase activity at elevated 
temperatures in the range of 45°C - 95°C, said first enzyme 
has substantially higher activity than said second enzyme in 
catalyzing reverse transcription at said temperature not 
exceeding 55°C; use of thermostable enzyme having reverse 
transcriptase activity in the preparation of a composition 
for use in reverse transcription of RNA which has previously 
been in vitro reverse transcribed by another enzyme having 
reverse transcriptase activity; 
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